Unit Al 
Sets, functions and vectors 


Introduction 


Introduction to Book A 


M208 covers a wide range of pure mathematics, and each book apart from 
this one concentrates on one topic. This book is different, because it covers 
the main concepts that underlie the topics in the other books. 


In Unit Al you will review some of the important foundations of pure 
mathematics and the mathematical language used to describe them. You 
will start with the plane, and revise ideas relating to points, lines and 
circles. You will then study in detail the mathematical ideas of a set 
(mostly of numbers or of points in the plane), and a function, including 
functions of real numbers and functions of points in the plane. Finally, you 
will consider vectors in the plane and in three-dimensional space. 


In Unit A2 you will look at number systems and their properties. You will 
first consider real numbers, and sets of real numbers, such as the integers 
and the rational numbers, then study complex numbers, investigate their 
properties, and look at some functions of complex numbers. Finally, you 
will study modular arithmetic, which provides examples of finite number 
systems. 


In Unit A3 you will concentrate on mathematical language and 
communication. You will study the important subject of mathematical 
proof, including the use of different methods of proof, and how to disprove 
a statement by finding a counterexample. You will also consider errors in 
mathematical arguments including errors in deduction. Finally, you will 
study equivalence relations and the idea of a partition of a set. 


In Unit A4 you will concentrate on real functions, and on how to draw 
their graphs. You will review the graphs of various common functions, and 
consider a wide range of functions and their properties, including 
trigonometric and hyperbolic functions. Finally, you will consider curves 
that are not the graphs of real functions including conics (circles, 
parabolas, hyperbolas and ellipses) and see that they can be described in 
terms of a single parameter. 


Introduction 


In this unit you will look at some of the most fundamental mathematical 
concepts underlying pure mathematics. Many of these concepts should not 
be new to you, but working through this unit should ensure that you 
understand them to the level needed for M208. 


Sections 1 to 3 contain basic material that will be crucial throughout the 
module. It is vital that you become familiar and confident with the ideas 
and notation introduced in these sections. Section 4 revises concepts that 
will be used later in the module, in particular in Book C Linear algebra. 


Unit A1 Sets, functions and vectors 


Figure 1 The real line 


YA 
(a,b) 


A 
p= 
| 
| 
| 
| 
| 
a 


> 
ax 


O 


Figure 2 Cartesian 
coordinates 


René Descartes 


Rv 


1 Points, lines and distance 


In this section you will revise points, lines and distance in two- and 
three-dimensional space. 


1.1 The plane 


The set of all real numbers is denoted by R, and this set can be pictured as 
an infinitely long number line, often called the real line, as shown in 
Figure 1. Each real number a corresponds to a point on the line. 


In this subsection we consider the plane, or two-dimensional space. To 
allow us to specify the locations of points in the plane, we usually use a 
pair of perpendicular axes, known as Cartesian or rectangular axes. We 
usually label the axes x and y; we refer to their intersection point as the 
origin and sometimes label it O. Finally, we choose a unit of distance. 
The location of any point in the plane can be specified by using an 
ordered pair (a,b) of real numbers, known as Cartesian coordinates or 
just coordinates, that give the position of the point relative to the axes, 
as shown in Figure 2. (An ordered pair is a pair in which order matters; 
for example, the ordered pair (2,3) is different from the ordered pair 

(3, 2).) We write A(a,b) to specify the point A with coordinates (a, b). 

It is important to understand that the coordinates of a point depend on 
where the axes have been placed in the plane; if we had chosen the axes to 
be in a different position, then usually the coordinates of the point would 
be different. However, once we have chosen the position of the axes, we 
often do not bother to distinguish explicitly between a point and its 
representation using these coordinates: we simply write (a,b) to denote the 
point A. 


We use the notation R? to denote the plane. 


The adjective Cartesian comes from the surname of the French 
mathematician and philosopher René Descartes (1596-1650). He was 
the first person to show in print how algebra could be used to study 
geometry, in his 1637 publication La géométrie. Descartes’ procedure 
differed from the system of Cartesian coordinates that we use today. 
His axes were not necessarily at right angles, and could be chosen in 
relation to the circumstances of the problem rather than being given 
in advance. 


The plane, together with an origin O and a pair of x- and y-axes, is 
known as two-dimensional Euclidean space. 


Euclidean space is named after the Greek mathematician Euclid. 
Little is known for certain about Euclid but he is believed to have 
worked in Alexandria in around 300 BCE. 


Euclid’s Elements, a mathematical treatise of thirteen books which 
had its origins on papyrus rolls, has become one of the most frequently 
printed texts of all time. Although Elements covers both plane and 
solid Euclidean geometry, Euclid had no notion of axes or coordinates. 


Lines 
The equation of any straight line in R?, except a line parallel to the y-axis, 
can be written in the form 
yY=mMr+C, 
where m,c E R. 
In this equation: 
e mis the gradient (or slope) of the line, given by 
y2 — Yi 
= =—, (1) 
T2 — T1 
where (x1, y1) and (x2, y2) are any two points on the line such that 
T1 É T2 
e cis the y-intercept of the line; that is, (0, c) is the point at which the 
line crosses the y-axis, as illustrated in Figure 3(a). 


m 


The line with gradient m that crosses the y-axis at the origin has equation 
y = mz, since c = 0 in this case; see Figure 3(b). The horizontal line 
(parallel to the x-axis) with y-intercept c has equation y = c, since the 
gradient m = 0 in this case; see Figure 3(c). 


The equation of a line parallel to the y-axis cannot be written in the 
form (1). The vertical line (parallel to the y-axis) with x-intercept a has 
equation x = a; see Figure 3(d). The equation of such a line cannot be 
written in the form y = ma + c because the gradient is undefined. 


y 


Y = mr 


Xy 


(a) (b) 


Figure 3 Lines in the plane 


In all of the cases above, the equation of the line in the plane can be 
rearranged to take the form 


ax + by =c, (2) 
for some real numbers a, b and c, where a and b are not both zero. (Note 


that the numbers a and c here are not the same as those called a and c in 
Figure 3.) 


1 Points, lines and distance 


Unit A1 Sets, functions and vectors 


y=-—2r-3 


Figure 4 Parallel and 
perpendicular lines 


In fact, any line in R? has an equation of the form (2) and, conversely, any 
equation of the form (2) represents a line in R?. 


Equation of a line 


The general equation of a line in R? is 
ax + by =c, 


where a, b and c are real numbers, and a and b are not both zero. 


From formula (1) for the gradient of a line, we can see that the equation of 
the line with gradient m that passes through the point (21, y1) is 


y= yi = m(r— 21). 


Exercise Al 


Determine the equation of the line with gradient —3 that passes through 
the point (2, —1). 


Exercise A2 


Determine the equation of the line through each of the following pairs of 
points. 


(a) (1,1) and (3,5) (b) (0,0) and (0,8) (c) (0,0) and (4, 2) 
(d) (4,—1) and (2, —1) 


Parallel and perpendicular lines 


Two distinct lines are parallel if they never meet, and perpendicular if 
they meet at right angles. 


Saying that two non-vertical lines are parallel is equivalent to saying that 
they have the same gradient but different y-intercepts. For example, as 
shown in Figure 4, the lines y = —2x + 7 and y = —2x — 8 are parallel 
since they both have gradient —2 but their y-intercepts are 7 and —3, 
respectively, whereas the lines y = —2x” + 7 and y = 2x — 3 are not parallel 
since their gradients —2 and 2 are not equal. 


We can also use the gradients of a pair of non-vertical lines to check 
whether they are perpendicular, as follows. 


Gradients of perpendicular lines 
Let lı and lg be lines with gradients mı and mz, respectively. 
e Ifl; and l2 are perpendicular, then mim = —1. 


e If mymg = —1, then lı and l are perpendicular. 


To see that the first statement in the box is true, suppose that the lines l 
and lz are perpendicular and that neither line is vertical. Let the gradients 
of lı and lz be mı and mg, respectively. Then one of the lines (1), say) 
must slope up from left to right and the other (l2, say) must slope down 
from left to right, as shown in Figure 5. 


Figure 5 Perpendicular lines 


Let the lines intersect at P, and let Q be a point on lı to the right of P. 
Suppose that Q is a units to the right of P and b units up from P, as 
illustrated in Figure 5. Let R be the point on lə obtained by rotating PQ 
anticlockwise through a right angle; then R is b units to the left of P and a 
units up from P, as shown. 


It follows that the gradient of lı is mı = b/a, and the gradient of lə is 
ma = —a/b. Hence 


mım = ? x (-<) =, 


The proof of the second statement in the box above is not given here. 


Worked Exercise A1 


Determine which of the following lines are parallel, and which are 
perpendicular to each other. 
l: y =—2r +4 Ig: 2r -38y-2=0 klz: y—2r=9 
l4: 2y+3r+5=0 b: a+ sy+2=0 Ig: 2y = 3r +7 


1 Points, lines and distance 


Unit A1 Sets, functions and vectors 


Exercise A3 
Determine which of the following lines are parallel, and which are 
perpendicular to each other. 


l: y=—2r +4 lo: 6r —3y+4=0 lz: 2y+a=10 
l4: 6y-—3@+5=0 I5: x-2y+2=0 le: 2y +4r+7=0 


Distance between two points in the plane 


Next, we find the formula for the distance between any two points in the 
plane. 


We use the idea of the modulus of a real number k, written |k| and 
defined by 


A = k, ifk>0, 
© |=-k, ifk<0. 


(The modulus of k, usually read as ‘mod k’ is sometimes called the 
absolute value or magnitude of k.) 


Suppose that P(x1,y1) and Q(x2, y2) are two points in the plane, as shown 
in Figure 6. We can construct a right-angled triangle PNQ as shown: the 
line PN is parallel to the z-axis, the line QN is parallel to the y-axis, the 
angle PNQ is a right angle, and PQ is the hypotenuse of the triangle. In 
Figure 6, P and Q are drawn in the first quadrant and with PQ sloping up 
from left to right, but the formula holds wherever the points are in the 
plane. 


al Q(£2, y2) 


N (22, y1) 
P(2x1,91) 


T 
Figure 6 Distance between P and Q in the plane 


The length of PN is |x2 — xı| and the length of QN is |y2 — yıļ. It follows 
from Pythagoras’ Theorem that 


PQ? = PN? + QN?, 
and since |k|? = k? for any real number k, we have 


PQ = y (z2 — z1}? + (y2 — y1}. 


Distance formula for R? 


The distance between the two points (x1, yi) and (x2, y2) in the plane 
is 


(eo 21)” = Qa = gm) 


For example, it follows from the formula above that the distance between 
the points (1,2) and (3, —4) is 


(3—1)2 + (—4— 2)2 = ,/22 + (6)? 


= /40 = V4 x 10 
= 4710 = 2/10. 


Exercise A4 
Find the distances between the following pairs of points in the plane. 
(a) (0,0) and (5,0) (b) (0,0) and (3,4) (c) (1,2) and (5,1) 
(d) (3,—8) and (—1,4) 


Circles 


A circle in R?, as illustrated in Figure 7, is the set of points P(x, y) that 
lie at a fixed distance r, called the radius, from a fixed point C (a,b), 
called the centre of the circle. 


YA 
P(x,y) 


> 
x 


Figure 7 A circle with radius r and centre (a, b) 


By the distance formula, every point (x,y) on the circle with centre (a,b) 
and radius r satisfies the equation 


V (a — a)? + (y—b)? =r. 


Squaring this equation to remove the square root gives the following. 


Equation of a circle 


The equation of the circle in R? with centre (a,b) and radius r is 
(z-a) + (y -b =r. 


1 Points, lines and distance 


Unit A1 Sets, functions and vectors 


10 


In this unit we will just work with equations of circles in this form, 
without multiplying out the brackets. In Unit A4 Real functions, graphs 
and conics, you will see how multiplying out the brackets leads to other 
forms for the equations of circles. 


Worked Exercise A2 


Find the equation of the circle with centre (—1,2) and radius V3. 


Exercise A5 


Determine the equation of each of the following circles, given the centre 
and radius. 


(a) Centre the origin, radius 4. 
(b) Centre (—1,0), radius V2. 
(c) Centre (3,—4), radius 2. 


1.2 Three-dimensional space 
We now look briefly at three-dimensional space. 


We define a coordinate system in three-dimensional space using three 
mutually perpendicular axes. The word mutually here means that the 
condition holds for any pair, so mutually perpendicular means that any 
two of the axes are perpendicular. 


First, we choose a point O as the origin, and then we choose an x-axis and 
a y-axis at right angles to each other. Next, we draw a third line through 
the origin, perpendicular both to the x-axis and to the y-axis; this line is 
called the z-axis. We choose the positive direction of the z-axis to be such 
that the x-, y- and z-axes form a so-called right-handed system of axes. 
This means that if you hold the thumb and first and second fingers of your 
right hand at right angles to each other, and label them x, y and z, in that 
order, then you can turn your hand in such a way that your fingers point 
in the positive directions of the corresponding axes, as shown in Figure 8. 


Figure 8 A right-handed system of coordinate axes for R° 


Finally, we choose a unit of distance. 


We represent each point in three-dimensional space by an ordered triple 
(a,b,c) of real numbers. The point with coordinates (a, b,c) is reached 
from the origin by moving a distance a in the direction of the x-axis, a 
distance b in the direction of the y-axis, and a distance c in the direction of 
the z-axis, as illustrated in Figure 9(a). 


For instance, the point with coordinates (—3, —2, 4) is shown in 
Figure 9(b). 


Figure 9 Three-dimensional Cartesian coordinates 


In Figure 9, the plane containing the z-axis and the y-axis is shaded. 
Usually we think of this plane as being horizontal, and the z-axis as being 
vertical. 


We use the notation R? to denote three-dimensional space. 
Exercise A6 


Sketch the z-, y- and z-axes and the points with coordinates (0, 1,2) and 
(1,91). 


1 Points, lines and distance 


11 


Unit A1 Sets, functions and vectors 


12 


As with R?, once we have chosen the position of the axes, we often do not 
bother to distinguish explicitly between a point and its representation 
using these coordinates; we simply write (a,b,c) to denote the point in R3 
represented by this triple. 

Three-dimensional space, together with an origin and a set of x-, y- and 
z-axes, is known as three-dimensional Euclidean space. 


Distance between points in R 


You saw in Subsection 1.1 that the distance between two points (21, y1) 
and (#2, y2) in the plane is given by 


y (x2 = z1)? + (y2 = y1}. 


We can establish a similar formula for the distance between two points 
in R3, as follows. 


Let P(x1,y1, 21) and Q(2, Y2, 22) be two points in RÌ. Let M be the point 
(x2, y2, 21); then M lies in the same horizontal plane as P, and MQ is 
parallel to the z-axis. Next, let N be the point (21, y2, 21); then N also lies 
in the same horizontal plane as P, and MN and NP are parallel to the zx- 
and y-axes, respectively. 


The triangles PQM and PMN are both right-angled triangles, with right 
angles at M and N, respectively, as shown in Figure 10. 


ZA 


Q(x2, Y2, z2) 


FRc iyi | 


N(21, yo, 21) 


T M (22, Y2, z1) 
Figure 10 Distance between P and Q in R 


The length of PN is |y2 — yı| and the length of NM is |x2 — x1|. It follows 
from Pythagoras’ Theorem that 
PM? = NM? + PN’, 
so 
PM? = (22 — 21)? + (y2 — y1}. 
Using Pythagoras’ Theorem again gives 
PQ? = PM? + MQ’, 
and since the length of MQ is |z2 — z1| we obtain 
PQ? = (#2 — 21)? + (yo — 1)? + (22 - 21)’, 
that is, 


PQ = y (#2 — £1)? + (y2 — y1)? + (22 — 21). 


Distance formula for R3 
The distance between the two points (x1, y1, 21) and (2, y2, z2) in R3 
is 


(= a21)” FE) = m) en a 


For example, it follows from this formula that the distance between the 
points (1,2,3) and (4, —2, 15) is 


— 1)? + (—2— 2)? + (15 — 3)? = V169 = 13. 


Exercise A7 


Find the distances between the following pairs of points in R. 
(a) (1,1,1) and (4, 1, —3) (b) (1, 2, 3) and (3, 0,3) 


We will return to the topic of three-dimensional space in Section 4, where 
we will consider vectors in R? as well as in R?, and find the general 
equation of a plane in R3. 


2 Sets 


In this section you will revise the notion of sets, learn new notation for 
describing sets, and practise working with sets and set notation. These 
skills will be crucial in the rest of the module. 


2.1 What is a set? 


In mathematics we frequently work with collections of objects of various 
kinds. We may, for example, consider the following: 


e solutions of a quadratic equation 
e points on a circle 

e vertices of a triangle 

e points on a plane in RÌ 

e even numbers less than 100 


e students taking a particular examination. 


The concept of a set allows us to work with such collections systematically. 


You can think of a set as a collection of objects, such as numbers, points, 
functions, or even a collection of other sets. Each object in a set is an 
element or member of the set, and the elements belong to the set, or are 
in the set. 


2 Sets 


13 


Unit A1 Sets, functions and vectors 


el 


Figure 11 A Venn diagram of 
the set S 


14 


There is no restriction on the types of object that may appear in a set, 
provided that the set is specified in a way that enables us to decide, in 
principle, whether a given object is in the set. 


There are many ways of making such a specification. For example, we can 
define S to be the set of numbers in the list 
4, 9, 3, 2. 


This enables us to decide that the number 2 (say) is in S, but that the 
number 1 (say) is not in S. We can illustrate this set by a diagram, as in 
Figure 11, where the symbol S is not a member of the set but a label for 
it. (Similar labels will appear in other diagrams.) Such a diagram is called 
a Venn diagram, after the nineteenth-century Cambridge mathematician 
John Venn. 


We can also define a set by describing its elements; for example, 
let E be the set of all even integers. 


This description enables us to determine whether a given object is in E by 
deciding whether it is an even integer; for example, 6 is in EF, but 5 is not. 


Some sets are used so often that special symbols are reserved for them. 
Recall that a real number is a number with a decimal expansion (possibly 
infinite), for example, 1.1 or m = 3.14..., and a rational number is a real 
number that can be expressed as a fraction, for example, 14/5 or —3/4. 
You will revise these sets more thoroughly in Unit A2 Number systems. 
We use the following notation, some of which you met in Section 1. 


R denotes the set of real numbers. 

IR* denotes the set of non-zero real numbers. 

Q denotes the set of rational numbers. 

Z denotes the set of integers ...,—2,—1,0,1,2,.... 
N denotes the set of natural numbers 1, 2,3,.... 


A finite set is a set that has a finite number of elements; that is, the 
number of elements is some natural number, or 0. Any set that is not a 
finite set is an infinite set. 


We use the symbol € to indicate membership of a set; for example, we 
indicate that 7 is a member of N by writing 


TEN. (This is usually read as ‘7 belongs to N’ or ‘7 is in N’.) 
We indicate that —9 is not a member of N by writing 
—9 EN. (‘—9 does not belong to ÑN’ or ‘—9 is not in N’.) 


We also use the symbol € when we wish to introduce a symbol that stands 
for an arbitrary (that is, general, unspecified) element of a set. For 
example, to indicate that x is a real variable, that is, an arbitrary 
member of the set R, we write 


let x ER. 


2 Sets 
We often write 71,22 E S as shorthand to combine zı € S and rg E€ S. 


Exercise A8 


Which of the following statements are true? 
(a) -3E€Z b) 5¢N (c) 13¢Q (da) 1,3€Q 
(e) -reER (f) EN (g) O01ER* (h) V2ER 


2.2 Set notation 


We now look at some formal ways of specifying a set. 


We can specify a set with a small number of elements by listing these 
elements between a pair of braces (curly brackets). For example, we can 
specify the set A consisting of the first five natural numbers, illustrated in 
Figure 12, by 

A = {1,2,3,4, 5}. 
The membership of a set is not affected by the order in which its elements Figure 12 The set A 
are listed, so we can specify this set A equally well by 


A = {5,2,1,4,3}. 


y 


Similarly, we can specify the set B of vertices of the square shown in 
Figure 13 by 


B = {(0,0), (1,0), (1, 1), (0, 1)}. 


We can even specify a set C, illustrated in Figure 14, whose elements are 
the three sets {1,3,5}, {9,4} and {2} by (0,0) (1,0) 7 


C= {{1, 3, 5}, {9, 4}, {2}}. Figure 13 The set B 


A set with only one element, such as the set {2}, is called a singleton or a 
singleton set. (Do not confuse the set {2} which contains the number 2, 


with the number 2 itself.) 
Exercise A9 C> 


Which of the following statements are true? 


a) 1€ {4,3,1,7} Figure 14 The set C 
b) {-9} € {{6, 1,2}, {8,7,9,5}, {9}, {5, 4}} 

c) {9} € {5,6,7,8,9} 

d) (0,1) € {(1,0), (1,4), (2, 4)} 

) 1,0 € {(1,0), (1,4), (2,4)} 

) {1,0} € {{0, 1}, {1,4}, {2, 4}} 


e 


( 
( 
( 
( 
( 
(£ 


15 


Unit A1 Sets, functions and vectors 


16 


It does not matter if we specify a set element more than once within set 
brackets. For example, 


{1,2,3,3} and {1,2,3} 
describe the same set. However, we usually try to avoid specifying an 
element more than once. 


For a set with a large number of elements, it is not practical to list all the 
elements, so we sometimes use three dots (called an ellipsis) to indicate 
that a particular pattern of membership continues. For example, we can 
specify the set consisting of the first 100 natural numbers by writing 


{1,2,3,... , 100}. 


The use of an ellipsis can be extended to certain infinite sets. For example, 
we can specify the set of all natural numbers by writing 


(19S ast, 


One disadvantage of this notation is that the pattern indicated by the 
ellipsis may be ambiguous. For example, it is not clear whether 


{3,5,7,...} 


denotes the set of odd prime numbers or the set of odd natural numbers 
greater than 1. For this reason, this notation can be used only when the 
pattern of membership is obvious, or where an additional clarifying 
explanation is given. 


An alternative way of specifying a set is to use variables to build up 
objects of the required type, and then write down the condition(s) that the 
variables must satisfy. For example, consider the set of all real numbers x 
such that x > 3. Using set notation, we write this as 


{x ER: 2 > 3}, 


which is read as shown in Figure 15. 


{| |ceER : o>; 


Figure 15 How to ‘read’ set notation 


A set can often be described in several different ways using such set 
notation. In particular, we can use a letter other than x to denote an 
arbitrary (general) element of a set; for example, the set above can also be 
written as 


{rE R:r> 3}. 


If it is necessary to include more than one condition after the colon, then 
we write either a comma or the word ‘and’ between the conditions. So the 


set of real numbers greater than 0, and less than or equal to 1, can be 
written as 


{fER:r2>0,x<1} or {xrER:z>0andz<1}, 
although usually we combine the inequalities and write 
{cER:0<2< 1}. 


Sometimes it is convenient to specify a set by writing an expression in one 
or more variables before the colon, and the conditions on the variables 
after the colon. For example, the set of even integers less than 100 may be 
specified by 


{2k : k € Zand k < 50}. 


Just as when listing the elements of a set, it does not matter when using 
set notation if a set element is specified more than once. For example, 


{sinz :x € R} 
specifies the same set as 


{sing :0 < x < 2r}. 


Exercise A10 


Which of the following statements are true? 

(a) 3e{rER:1>3} (b) TE {3kK4+1:k eZ} 

(c) —fe{xeZ:2<5} (da) 8€ {2 :xER, 0<2< 2} 

(e) 9E {n€ Z:n = k? for some k € Z} (f) 6€ {m(m—1):meEN} 
(g) 


g) 4€{r:r is an even integer, 0 <r < 4} 


Notice that the next worked exercise contains lines of blue text, marked 
with the icons ®. © . You will see similar text in some of the worked 
exercises and proofs throughout this module. This text tells you what 
someone doing the mathematics might be thinking, but would not write 
down; or what a lecturer might say to explain the thinking behind the 
mathematics, but would not write on the board. It should help you 
understand how you might approach a similar exercise yourself. 


2 Sets 


17 


Unit A1 Sets, functions and vectors 


18 


Worked Exercise A3 


Use set notation to specify each of the following. 


(a) The set of all natural numbers greater than 50. 
(b) The set of all odd integers. 


The choice of the variables is arbitrary in these sets, but k for an integer 
and n for a natural number are conventional. 


Exercise A11 


Use set notation to specify each of the following. 

(a) The set of integers greater than —2 and less than 1000. 
(b) The set of positive rational numbers with square greater than 2. 
(c) The set of even natural numbers. 

( 


d) The set of integer powers of 2. 


Set notation is useful when we wish to refer to the set of solutions of one or 
more equations (called the solution set). For example, the real solutions 
of the equation z? = 1 form the set 


{z €R: 2? =1} = {-1,1}. 


The solution set of an equation depends on the set of values from which 
the solutions are taken. For example, the solution set of the equation 


(a — 1)(22 —1) =0 

is 
{z ER: (—1)(2x- 1) = 0} = {1,5 

if we are interested in real solutions. However, the solution set is 
{xz €Z:(x—1)(22 —1) = 0} = {1} 


if we are interested only in integer solutions. In this unit we assume that 
solutions are taken from R unless otherwise stated. 


2 Sets 


Sometimes an equation has no real solutions, so its solution set has no 
elements. 


The set with no elements arises frequently in mathematics, so it is given a 
special name and notation. It is called the empty set and is denoted by 
the symbol @. Thus, for example, 


{z ER: z? = -—1} =Ø. 


The symbol for the empty set, @, was introduced in 1939 by the 
French mathematician André Weil (1906-1998), who took the symbol 
from the Norwegian alphabet. 


André Weil 


2.3 Intervals 


You saw in Subsection 1.1 that the set of real numbers R can be pictured 
as a number line, called the real line. Many sets involve ranges of real 
numbers extending along the real line from one number a to another 
number b. Each of the endpoints a and b may be either included or 
excluded. Such sets are called intervals of the real line, and they occur so 
frequently that we use special notation for them. For example: 


e the interval given by —2 < x < 5, in which both endpoints are excluded, 
is denoted by (—2,5) and is an example of an open interval 


e the interval given by —2 < x < 5, in which both endpoints are included, 
is denoted by [—2,5] and is an example of a closed interval 


e the intervals given by —2 < x < 5 and —2 < x <5, in which one 
endpoint is included and the other is excluded, are denoted by (—2, 5] 
and [—2,5), respectively, and are examples of half-open (or half-closed) 
intervals. 


In some texts, a reversed square bracket is used instead of a round bracket 
to indicate an excluded endpoint; for example |— 2, 5[ is used instead of 
(—2,5) for an open interval. 


We use the symbol oo (infinity) when an interval extends indefinitely far to 
the right on the real line, and the symbol —co when an interval extends 
indefinitely far to the left. For example: 


e the set of all real numbers greater than —3 is denoted by (—3, co) 
e the set of all real numbers less than or equal to 4 is denoted by (—oo, 4]. 


The symbol oo does not denote a real number: instead, it simply means 
that the interval continues indefinitely. We always use round brackets with 
co and —oo. 


The notation for intervals is summarised in the box below. 


19 


Unit A1 Sets, functions and vectors 


Interval notation 
Intervals are denoted as follows. 


Open intervals 


(a,b) (a, co) (S651) (—o0, 00) 
OO | (me) 
a D @ b 
G@ KW <K Sw ie <I} R 
Closed intervals 
[a, b] [a, co) (—0o, 0] (e060) 
a po E 
Oa Loli O >a ge Sw R 


Half-open (or half-closed) intervals 


[a, b) (a, b] 


onm O 


ee ee 


Remarks 


1. In the box above, a hollow dot o indicates that an endpoint is excluded, 
and a solid dot e indicates that an endpoint is included. 


2. A singleton set {a}, containing a single number a, is a closed interval 
whose endpoints are equal. 


3. An interval such as [a, o0) is regarded as closed, rather than half-open 
(or half-closed), because it contains all the real numbers greater than or 
equal to a. However, the interval R = (—oo, co) is considered to be both 
open and closed. 


4. We also use the notation (a,b) to denote a point in the plane, but in 
most cases it should be obvious whether a point or an interval is 
intended. 


Exercise A12 


Which of the following statements are true? 
(a) 1e(1,5) (b) 1e(=1,1] (c) œ €(0,œ) (d) O€ R* 


(e) If x € R*, then z € (0,00). 


Exercise A13 


Use interval notation to specify the following intervals. 


SN om 
(a) 4 0 2 


(b) The set of real numbers x such that —6.5 < x < 21. 
(c) {a €R: a > —273}. 


20 


2 Sets 


2.4 Plane sets 


In Subsection 1.1 you met the plane R?, and saw that each point in the 
plane can be represented as an ordered pair (x,y) with respect to a chosen 
pair of axes. A set of points in R? is called a plane set or a plane figure. 
The lines and circles that you met in Subsection 1.1 are simple examples of 
plane sets. 


Lines as plane sets yA A 


Consider a straight line lų with gradient m and y-intercept c, as illustrated 
in Figure 16. This line is the set of all points (x,y) in the plane such that 


d i gradient m 
y = mg +c. Using set notation, we write this as 


l ={(z,y) ER? :y = mr +e}. > 


(We often refer to ‘the line y = ma +c’ as a shorthand way of specifying : i 
Figure 16 The line l 


this set.) 
For a line lọ parallel to the y-axis with x-intercept a, as illustrated in y i 
Figure 17, we write = 
C= 

lp = {(z,y) € R? : z =a}. 
An alternative way of specifying a line is to write an expression for one or | F = 
both of the coordinates. For example, an alternative way of specifying the 
line lı with equation y = mg + c is Figure 17 The vertical line lə 


lL = {(z,mz +c): x €R}. 


It does not matter what variable we use to specify the line. For example, 
we can also write 


lL ={(t;,mt+c):tER} 


Exercise A14 
(a) Use set notation to specify the line l with gradient 2 that passes 
through the point (0,5). 
(b) Sketch the line l = { (x,y) € R? : y = 1 — z}. 
(c) Sketch the line l = { (x, x£) : x € R}. 


Circles as plane sets 


Consider a circle C with centre (a,b) and radius r, as illustrated in 
Figure 18. This circle is the set of all points (x,y) in the plane such that 
(x — a)? + (y — b)? = r?, so, in set notation, it can be written as 


C = {(x,y) € R? : (x — a)? + (y—b)? = r°}. 


The unit circle U is defined to be the circle centred at the origin with 
radius 1, so it is the set of points (x,y) in the plane whose distance from Figure ie Acids 


21 


Unit A1 Sets, functions and vectors 


U, (x,y) 


Xy 


Figure 19 The unit circle U 


Xy 


Figure 20 The plane split 
into three parts by 1 


h “i 
l N Pe 
| 
Th m 
\ | 
è 
QN d 
im x 
N 
Figure 21 A point P in Hı 
YA 
Də 


ay 


Figure 22 The plane split 
into three parts by the unit 
circle U 


22 


the origin (0,0) is 1 (see Figure 19). In set notation, the unit circle can be 
written as 


U = {(x,y) E R? : 2? +y = 1}. 


Exercise A15 


(a) Use set notation to specify the circle C of radius 3 centred at (1, —4). 
(b) Sketch the circle C = {(x, y) E€ R? : (x — 1)? + (y— 3)? = 4}. 


Half-planes, discs and other plane sets 


Consider the line 
l= {(z,y) ER? :y=1-— zr}. 


This line splits R? into three separate parts, as shown in Figure 20: the 
line / itself, the set Hı of points lying above the line, and the set Hə of 
points lying below the line. 


For any point P = (x,y) in Hj, the point Q = (2,1 — x) lies on the line J, 
directly below P, as illustrated in Figure 21, so y > 1 — zx. Similarly, each 
point (x,y) in Hə satisfies y < 1 — z. Thus 


Hy = {(2,y) ER: y> 1-2} 
and 
Hy = {(2,y) € R?:y < 1-2}. 


The set of points on one side of a line, possibly together with all the points 
on the line itself, is known as a half-plane. A half-plane that does not 
include the points on the line can be specified using set notation as in the 
examples Hı and Hə above. The corresponding half-plane that includes 
the points on the line can be specified by changing the symbol > to >, or 
the symbol < to <. 

When we sketch a plane set that excludes a boundary line, as for the set 
H; in Figure 21, we draw the boundary as a broken line; if the plane set 
includes a boundary line, then we draw the boundary as a solid line. 

We can treat other plane sets in a similar way. For example, consider the 
unit circle 


U ={(2,y) E R? : 27 +y? = 1}. 


This circle splits R? into three separate parts, as illustrated in Figure 22: 
the circle U itself, the set Dı of points lying inside the circle and the 
set Də of points lying outside the circle. 


The condition for a point (x,y) to lie inside U is that the distance of the 
point from the origin is less than 1. It follows that the square of the 
distance of the point (x,y) from the origin is also less than 1, so 


Dı = {(2,y) E R? : 2? +y? < 1}. 


Similarly, 
Də = {(2,y) E R? : 2? +4? > 1}. 


The set of points inside a circle, possibly together with all the points on 
the circle, is known as a disc. Figure 23 shows the disc D; with the broken 
line indicating that the points on the circle are not included in the set. 


If we wish to specify the disc consisting of the unit circle together with the 
points inside it, we replace the inequality < by < in the set notation 
specification of Dı given above, and draw the boundary as a solid line. 


As another example, consider the set of points lying inside the square with 
vertices (0,0), (1,0), (1,1) and (0,1), shown in Figure 24. This set can be 
written as 


{(2,y) € R?:0<2<1,0<y< 1}. 


The square boundary is excluded from this set, and we indicate this by 
drawing the boundary lines as broken lines and the vertices as hollow dots, 
as in Figure 24. 


If we wish our set to include the square boundary, we replace each symbol 
< by <, and we indicate this in a sketch by drawing the boundary lines as 
solid lines and the four vertices as solid dots. 


These conventions for drawing plane sets are consistent with those you met 
earlier for intervals. They are summarised below. 


Convention for drawing sets in R or R? 
In a diagram of a subset of R or R?: 


e included and excluded points are drawn as solid and hollow dots, 
respectively 


e included and excluded boundaries are drawn as solid and broken 
lines, respectively. 


Exercise A16 


Sketch each of the following plane sets. 
(a) {(2,y) €R?:2< 1} 


(b) {(2,y) E R? : y< 2-22} 
(c) {(x,y) E R? : @-1P +@—2) <4} 
(da) {(2,y) € R? : z? + (y +3) > 1} 


Exercise A17 


Use set notation to specify the set of points inside the square with vertices 
(0,1), (2,1), (2,3), (0,3), together with the boundary, and sketch this set. 


2 Sets 


p 
A 
Á 
| Da 
l > 
| T 
\ 
N 
ha 
< 
Figure 23 The disc D, 
YA 
(0, 1)> Be o(1, 1) 
l 
l 
l 
l 
O Ò > 
(0,0) (1,0) 7 
Figure 24 The points inside a 
square 


23 


Unit A1 Sets, functions and vectors 


Figure 25 
set B 


24 


A subset A of a 


2.5 Set equality and subsets 


Consider the sets A = {1,-1} and B= {x € R : z? — 1 = 0}. Although 
these sets are written in different ways, each set contains exactly the same 
elements, 1 and —1. We say that these sets are equal. 


Definition 
Two sets A and B are equal if they have exactly the same elements; 
we write A= B. 


When two sets each contain a small number of elements, we can usually 
check whether these elements are the same, and hence decide whether the 
sets are equal. 


Exercise A18 


Decide whether each of the following is a pair of equal sets. 
(a) A= {2,—3} and B = {x E R: z? +g -—6 = 0}. 


(b) A={keZ:k is odd and0 < k < 8} and 
B={2n+1:n €N and n? < 25}. 


If two sets each contain more than a small number of elements, it is less 
easy to check whether they are equal. You will meet a method for dealing 
with cases like this shortly, but first we need the following idea. 


Consider the sets A = {7,2,5} and B = {2,3,5,7,11}. These sets are 
illustrated in the Venn diagram in Figure 25. Each element of A is also an 
element of B. We say that A is a subset of B. 


Definition 
A set A is a subset of a set B if each element of A is also an element 
of B. We also say that A is contained in B, and we write A C B. 


Do not confuse the symbol C with the symbol €. For example, we write 
{1} C {1,2,3} and 1€ {1,2,3}, 
because {1} is a subset of {1,2,3} and 1 is an element of {1,2,3}. 


We sometimes indicate that a set A is a subset of a set B by reversing the 
symbol C and writing B D A, which we read as ‘B contains A’. 


To indicate that A is not a subset of B, we write A É B. We may also 
write this as B D A, which we read as ‘B does not contain A’. 


The next box gives two simple but important facts about subsets. 


Subsets of every set 

For every set B: 

e B is a subset of itself, that is B C B 

e the empty set Ø is a subset of B, that is, @ C B. 


The first result in the box follows immediately from the definition of a 
subset, given earlier. The second result in the box also follows from the 
definition, since any set B contains every element of the empty set, for the 
simple reason that the empty set has no elements. 


When we wish to determine whether a set A is a subset of a set B, the 
method we use depends on the way in which the two sets are defined. If A 
has a small number of elements, then we can check individually whether 
each element of A is an element of B. Otherwise, we determine 
algebraically whether an arbitrary element of A fulfils the membership 
criteria for B, as illustrated in Worked Exercise A4 below. 


To show that a set A is not a subset of a set B, we need to find at least 
one element of A that does not belong to B. 


Worked Exercise A4 


In each of the following cases, determine whether A C B. 
(a) A= {1,2,—4} and B = {z € R : zř + 4zî — z — 4 = 0}. 
(b) A={(z,y) ER? : z2? +4? <1} and B = {(z,y) ER? : 2 < 1}. 


2 Sets 


25 


Unit A1 Sets, functions and vectors 


26 


Exercise A19 


In each of the following cases, determine whether A C B. 

(a) A= {(5,2), (1,1), (—3,0)} and B = {(z,y) € R? : z — 4y = —3}. 
(b) A={(z,y) E€ R? : x? +y? <1} and B={(z,y) E R?: y <0}. 
(c) A=[-1,0] and B = {z E€ R: (x +1} <1}. 


If a set A is a subset of a set B that is not equal to B, then we say that A 
is a proper subset of B, and we write A C B or B D A. 


In some texts, the symbol C is used to mean ‘is a subset of’ (for which we 
use the symbol C) rather than ‘is a proper subset of’. 


To show that a set A is a proper subset of a set B, we must show both 
that A is a subset of B, and that there is at least one element of B that is 
not an element of A. 


Worked Exercise A5 
Show that A is a proper subset of B, where: 
A= {(z,y) E€ R?:2?+y? <1} and B= {(z,y) € R?: x < 1}. 
(A and B are the sets you met in Worked Exercise A4(b).) 


Exercise A20 


In each of the following cases show that A is a proper subset of B. 
(a) A= {(5,2), (1,1), (-3,0)} and B = {(a,y) € R? : x — 4y = —3}. 
(b) A=[-1,0] and B = {x E R: (x+1)? < 1}. 


(These sets are the same as those in Exercise A19(a) and (c).) 


We now return to the question of how we can show that two sets A and B 
are equal if they have more than a small number of elements. 


If A is a subset of B, we have seen that A is either a proper subset of B or 
is equal to B. Similarly, if B is a subset of A, then B is either a proper 
subset of A or is equal to A. It follows that, if A is a subset of B and B is 
a subset of A, then the two sets A and B must be equal. This gives us our 
strategy. 


Strategy A1 

To show that the sets A and B are equal: 
e first show that A C B 

e then show that B C A. 


2 Sets 


27 


Unit A1 Sets, functions and vectors 


Worked Exercise A6 


Show that the following sets are equal: 
A= {(cost,sin t) : t € [0,27]} and 
B={(z,y) ER? : 27 +77 =1}. 


Solution 


®@. We could specify A by 
A={(z,y) ER a =cost, y=sint for some t € [0,2z]}. @ 


First we show that A C B. 
Let (x,y) be an arbitrary element of A; then (x,y) is a point in R?. 
We have x = cost and y = sint, for some t € [0,27]. So 
r? Hp = cos? t + sin? t = 1. 
This implies that (x,y) € B, so AC B. 
Next we show that B C A. 
Let (x,y) be an arbitrary element of B; then 
mae y? = ll. 
So (x,y) lies on the unit circle. 


®. To show that (x,y) is an element of A, we need to find an angle 
t € [0,27] such that (x,y) = (cost, sint). A sketch will help. @ 


If we take t to be the (anticlockwise) angle from the (positive) x-axis 
to the line joining the point (x,y) with the origin, then t € [0,27], and 


x=cost and y=sint. 
y 
(x,y) 


sint 


> 
J 


It follows that (x,y) € A, so BC A. 
Since A C B and B C A, it follows that A = B. 


28 


2 Sets 


Exercise A21 


In each of the following cases, show that the sets A and B are equal. 
(a) A={(t?,2t):t €R} and B= {(z,y) € R?: y? = 4r}. 
(b) A={(z,y) E€ R? : 2e+y—-3=0} and B={(t+1,1—2t):t € R}. 


2.6 Set operations 


Consider the two sets {2,3,5} and {1,2,5,8}. Using these sets, we can 
construct several new sets — for example: 


e the set {1, 2,3,5,8} consisting of all elements belonging to at least one 
of the two sets 


e the set {2,5} consisting of all elements belonging to both of the two sets 
e the set {3} consisting of all elements belonging to the first set but not 


the second, and the set {1,8} consisting of all elements belonging to the 
second set but not the first. 


Each of these new sets is a particular instance of a general construction for 
sets. We now consider them in turn. 


Union 


You saw above that if A = {2,3,5} and B = {1,2,5,8}, then the set of all 
elements belonging to at least one of the sets A and B is {1,2,3,5,8}. We 
call this set the union of A and B. 


More generally, we have the following definition, which is illustrated by the 
Venn diagram in Figure 26. 


Definition a o) 


Let A and B be any two sets; then the union of A and B is the set AUB 


AUB={a:x€ Aorxve B}. Figure 26 The union of sets 
Aand B 


The word or in this definition is used in the inclusive sense of ‘and/or’; 
that is, the set AU B consists of the elements of A and the elements of B, 
including the elements in both A and B. In everyday language, an example 
of ‘or’ used in the exclusive sense is ‘Tea or coffee?’, since the answer 
‘Both, please!’ is not expected. An example of ‘or’ used in the inclusive 
sense is ‘Milk or sugar?’, since in this case you could answer ‘Both’. 


29 


Unit A1 Sets, functions and vectors 


Worked Exercise A7 


(a) Simplify [—2, 4] U (0, 10). 
(b) Sketch a diagram depicting the union of the half-plane H and the disc 
D, where 
H = {(x,y) ER? : y < 2-22}, 
D = { (x,y) € R? : (x — 1)? + (y—2)? < 4}. 


Solution 


(a) ©. These intervals overlap. .® 


E 
=A 0 4 10 


We have [—2, 4] U (0,10) = [—2, 10). 
(b) ®. These are the half-plane and disc from Exercise A16(b) 


and (c). @ 

YA 

A. 

/ (oa LP 
2, 12 D \+(y-2)?=4 

\ M 

N 7 

= 


®. The union consists of all the points in H or D or both; the two 
points where the circle and line meet are both in the set H and 
so are both in the union H U D and are shown as solid dots. © 


The set A U D is as follows. 


all 
Sv 


30 


2 Sets 


When sketching a set such as that in Worked Exercise A7(b), you should 
include enough detail so that the set is clear, and therefore the axes and an 
indication of scale are essential. Finding the exact points where the circle 
and line meet is not required, but can sometimes be helpful. In this case, 
substituting y = 2 — 2a into the equation for the circle gives 

(a — 1)? + (21)? =4, 
which simplifies to 

ba? — 2a —3 =0. 
This factorises as 

(x — 1)(52 + 3) =0, 


which has solutions x = 1 and x = —3, so the circle and line meet at the 
two points (1,0) and (—2,32). 


Exercise A22 


(a) Simplify (1,7) U [4, 11]. 
(b) Express the set R* as a union of intervals. 
(c) Sketch a diagram depicting the union of the half-plane H and disc D, 
where 
H ={(2,y) ER y < 0}, 
D = {(z,y) ER? : 2? +y? <4). 


So far you have seen the definition of the union of two sets. There is a (\ 
similar definition for the union of any number of sets; for example, the 


union of three sets A, B and C, as illustrated by the Venn diagram in aoe 


Figure 27, is the set 


AUBUC={a:2E€AorxEe BorxrzeEeCc}h. AUBUC 


Intersection Figure 27 The union of sets 


A, BandC 
You saw above that if A = {2,3,5} and B = {1,2,5,8}, then the set of all 


elements belonging to both set A and set B is {2,5}. We call this set the 
intersection of A and B. 


More generally, we have the following definition, which is illustrated by the 


Venn diagram in Figure 28. a D 


Definition 
: : ANB 
Let A and B be any two sets; then the intersection of A and B is 
the set Figure 28 The intersection of 
sets A and B 


ANB={zx:xE€ Aand ze B}. 


31 


Unit Al 


32 


Sets, functions and vectors 


Two sets with no element in common, such as {1,3,5} and {2,9}, are said 
to be disjoint. We write this as {1,3,5} {2,9} = Ø since this 
intersection is empty. 


Worked Exercise A8 


(a) Simplify [—2, 4] N (0, 10). 
(b) Sketch a diagram depicting the intersection of the half-plane H and 
disc D, where 
H ={(2,y) E R? : y < 2 — 2r}, 
D = { (x,y) € R? : (x — 1)? + (y — 2} < 4}. 


Solution 


(a) ®. The intersection is the overlap of these intervals. © 


® oe T * 
=e 0 4 10 


We have [—2, 4] N (0, 10) = (0, 4]. 


(b) ®. These are the half-plane and disc from Exercise A16(b) 
and (c), and Worked Exercise A7. © 


®. The intersection consists of all the points in both H and D. 
Neither of the points where the circle and the line meet are in the 
set D, so these points are not in the intersection H N D, and 
both are shown as hollow dots. .& 


The set HM D is as follows. 


YA 
l2 HAD 
N 
N 
its > 
il ag 


2 Sets 


Exercise A23 


(a) Simplify (1,7) A [4,11]. 


(b) Sketch a diagram depicting the intersection of the half-plane H and 
disc D, where 


H = {(2,y) ER y < 0}, 
D={(a,y) E R? : 2? +y? < 4}. 


(These are the same sets as in Exercise A22(a) and (c).) 


So far you have seen the definition of the intersection of two sets. There is 
a similar definition for the intersection of any number of sets; for example, 
the intersection of three sets A, B and C, as illustrated by the Venn 


diagram in Figure 29, is the set < 


ANBNC={a:reEAandxe B and z EC}. 


Difference ANBNC 
You saw above that if A = {2,3,5} and B = {1,2,5,8}, then the set of all Figure 29 The intersection of 
elements belonging to A but not to B is {3}; we call this set the sets A, B and C 


difference A — B. Similarly, the set of all elements belonging to B but not 
to A is {1,8}; this set is the difference B — A. 


More generally, we have the following definition, which is illustrated by the 
Venn diagram in Figure 30. 


Definition D 


Let A and B be any two sets; then the difference between A and B `M-B 


is the set 
Figure 30 The difference 


A-B={x:x€A, «¢ B}. between set A and set B 


Notice that A — B is different from B — A when A Æ B. This is unlike the 
union and intersection, where AU B = BUA and AN B= BN A, for any 
sets A and B. Also, for any set A, we have A — A = Ø, again unlike the 
union and intersection, where AU A= AN A = A. 


In some texts the difference A — B of two sets A and B is denoted by A\ B. 


33 


Unit Al Sets, functions and vectors 


Worked Exercise A9 


(a) Simplify [—2, 4] — (0,10) and (0, 10) — [—2, 4]. 
(b) Sketch diagrams depicting the differences H — D and D — H of the 
half-plane H and disc D, where 
H ={(2,y) E R? : y < 2 — 2r}, 
D = {(x,y) € R? : (x — 1)? + (y — 2)? < 4}. 


Solution 
(a) ———— y} 
OO 
w Qo T T 
—2 0 4 10 
—_—— 
ee SSD 
T T a . 
—2 0 4 10 
We have 


[—2, 4] — (0, 10) = [—2, 0], 
(0, 10) — [—2, 4] = (4, 10). 


(b) ©. Again these are the half-plane and disc from Exercise A16(b) 
and (c), and Worked Exercises A7 and A8. © 


®. Consider carefully the boundary points, and in particular, the 
points where the line and circle meet. Both of the meeting points 
are in H — D, as are the remaining points of the boundaries. 
Neither of the meeting points is in the difference D — H, nor are 
the remaining points of the boundaries. .©& 


The sets H — D and D — H are as follows. 


Sv 
Sv 


34 


3 Functions 


Exercise A24 


(a) Simplify (1,7) — [4,11] and [4,11] — (1,7). 
(b) Sketch diagrams depicting the differences H — D and D — H of the 
half-plane H and disc D, where 
H = {(x,y) ER? : y < 0}, 
D={(a,y) E R? : 2? +y? < 4}. 


(These are the same sets as in Exercise A22(a) and (c), and Exercise A23.) 


3 Functions 


In this section you will revise what is meant by a function, and some 
associated ideas. You will look at not only functions of real numbers, but Cottfied Wilhelm Leibniz 
also functions of other mathematical objects. The idea of a function is 
fundamental throughout this module, so it is vital that you have a good 
understanding of this topic. 


The term ‘function’ first emerged at the end of the seventeenth 
century in the correspondence of Gottfried Wilhelm Leibniz 
(1646-1716) and Johann Bernoulli (1667-1748). But it was Leonhard 
Euler (1707-1783) in the middle of the eighteenth century who was 
responsible for the essential development, notably through his 
Introductio in Analysin Infinitorum of 1748, the first work in which 
the concept of a function plays an explicit and central role. 


3.1 What is a function? 


You can think of a function as a machine for processing mathematical 
objects, such as numbers, points in the plane or vectors. 


Johann Bernoulli 


For example, consider the function f that takes non-zero real numbers as 
its inputs and whose rule is that the input x leads to the output 


f(x) =1/x. You can regard it as a machine that calculates the reciprocals 
of its input numbers. When 3 is fed into the machine, out comes š; when 

—2 is fed into the machine, out comes —4; and so on. Any real number in 
the domain R* of f can be processed by the machine to produce a real 


number in the codomain R of f, as illustrated in Figure 31. 


Leonhard Euler 


35 


Unit A1 Sets, functions and vectors 


domain A codomain B 


Figure 33 A general function 


36 


domain R* codomain R 


Figure 31 A function as a machine 


Similarly, consider the function g that accepts points in the plane as its 
inputs and whose rule is that the input (x,y) leads to the output 
g((x,y)) = y. You can regard it as a machine that calculates the 
y-coordinate of each input point. When the point (1,2) is fed into the 
machine, out comes 2; when the point (0,0) is fed into the machine, out 
comes 0; and so on. Any point in the domain R? of g can be processed by 
the machine to produce a real number in the codomain R of g, as 
illustrated in Figure 32. 

g 


— 


(ow = y 


domain R? codomain R 
Figure 32 Another function as a machine 
In general, imagine a machine that accepts an element x from some set A, 
and processes it to produce a single element f(x) in some set B. This 


machine corresponds to the following general definition of a function, 
which is illustrated in Figure 33. 


Definition 

A function f is defined by specifying: 
e a set A, called the domain of f 

e a set B, called the codomain of f 


e a rule z f(x) that associates each element x € A with a unique 
element f(x) € B. 


The element f(x) is the image of x under f. 


Symbolically, we write 
f:A—B 
cr f(z). 


We often refer to a function as a mapping, and say that f maps A to B 
and z to f(x). 


Notice that the definition of a function does not require every element of 
the codomain B to be the image of an element of the domain A, but it 
does require every element of the domain A to have an image in the 
codomain B. For example, a function with rule x +> sin z and domain R 
could have codomain R, or [—1, 1], or any set of real numbers of which 
[—1, 1] is a subset, but not, say, codomain [0,1] since the image of 37/2 is 
sin(37/2) = —1, which is not in this set. 


Notice also that the symbolic definition of a function given at the end of 
the box above specifies all three of the constituent parts of a function at 
once: the domain, the codomain and the rule. For example, the definition 


f:Z— Z 
n> n+ 1 
specifies a function with domain Z, codomain Z and rule f(n) =n+ 1. 
When we write a function symbolically, the first arrow is unbarred to 
signify a mapping from the domain A to the codomain B. The second 


arrow is barred, to show that the particular element x of A is mapped to 
the particular element f(x) of B. Each arrow is read as ‘maps to’. 


The following paragraphs give a number of examples of different types of 
functions. 


Real functions 


A function whose domain and codomain are both subsets of R is called a 
real function. Examples include the functions 


ip i and oo = 
tro > z> 2g — 5. 
x 


In some texts, a real function is defined to be a function whose codomain 
is a subset of R, but whose domain can be any set. 


You may be more familiar with seeing these functions written as simply 

f(x) =1/x and g(x) = 2x — 5. We write functions in this shortened way 
when it is understood from the context what the domain and codomain 

are. 


Distance function 
Functions of the form f : R? — R can be used to specify quantities 
associated with points in the plane. For example, the function 
f:R?—R 
(z,y)-> V2? +y? 
gives the distance of each point (x,y) in the plane from the origin, as 
shown in Figure 34. 


3 Functions 


Figure 34 ‘The distance of a 
point from the origin 


37 


Unit A1 Sets, functions and vectors 


Transformations of the plane 
Functions that have a geometric interpretation are often called 
transformations. Such functions include translations, reflections and 
rotations of the plane. We now look at some simple examples. For each 
one, the diagram shows the effect of the transformation on the square 
whose vertices are at (0,0), (1,0), (1,1) and (0,1); part of the square is 
shaded for clarity. 
e The transformation 
f:R?-R 
(x,y) > (x + 2,y) 
is the translation of the plane that shifts (or translates) each point to 
the right by 2 units, as illustrated in Figure 35. 


y 


: f 


L 2 wa %7 
Figure 35 ‘Translation 2 units to the right 
e The transformation 
f: RR 


(z, y) — (=t; y) 
is the reflection of the plane in the y-axis, as illustrated in Figure 36. 


y y 


Figure 36 Reflection in the y-axis 


e The transformation 
f: R — R? 
(x,y) +> (=a; —y) 
is the rotation of the plane through an angle m about the origin, as 
illustrated in Figure 37. 


38 


| 
a 

| 

— 

1 
=. -4 
Sv 
=J 
Sv 


Figure 37 Rotation through an angle m about the origin 


When specifying a function, like a transformation, where the elements of 
the domain are of the form (x,y), we simply write f(x,y) rather than 


f(x,y). 


Exercise A25 


For each of the following functions f : R? — R?, state whether f is a 
translation, reflection or rotation of the plane. 

(a) f(x,y) = (@+2,y +3) 

(b) f(x,y) = (x, -y) 

(c) f(x,y) = (=y, z) 


Functions whose domains are finite sets 


It is often useful to consider a function whose domain is a finite set. For 
example, we can define a function whose domain and codomain are the set 


A = {0,1,2,3,4,5,6,7,8,9} 


by 
f: ATA 
tis Y— g: 
When the domain of a function f has a small number of elements, we can 
specify the rule of f by listing the image f(x) of each element x in the 
domain. For example, let A = {0,1,2,3} and B = {2,3,4,5}; then we can 
define a function f : A — B by the rule 


f(0)=2, fQ)=2, f(2)=4, f(3) =5. 


We can represent the behaviour of this function by a diagram, as shown in 
Figure 38. A diagram of this type that represents a function always has 
the following features: 


e there is exactly one arrow from each element in the domain, since each 
element in the domain has exactly one image in the codomain 


e there may be no arrows, one arrow or several arrows going to an element 
in the codomain, since an element in the codomain may not be an image 
at all, may be an image of exactly one element in the domain, or may be 
an image of several elements in the domain. 


3 Functions 


pa 
\ T 
7 

A B 


Figure 38 Function f from 
set A to set B 


39 


Unit A1 Sets, functions and vectors 


In the example shown in Figure 38, the number 3 is not an image at all, 5 
is the image of 3 only, and 2 is the image of both 0 and 1. 


Exercise A26 
Which of the following diagrams represent(s) a function? 
(a) (b) (c) 


Ee) (SER) (Ag 


Identity functions 


Associated with any set A, there is a particularly simple function whose 


Pisin domain and codomain are the set A. This is the identity function iå, 
which maps each element of A to itself. (We sometimes omit the subscript 
ar, A if we do not need to emphasise the set.) 
EE For example, let A = {0, 1,2,3}; then the rule of the identity function i4, 
aR as illustrated in Figure 39, is 
A A ia(0)=0, ta(1)=1, t4(2)=2, i4(3)=3. 
Figure 39 An identity The following definition applies to any set A, finite or infinite. 
function 
Definition 


The identity function on a set A is the function 
ia: A—A 
T= T. 


3.2 Image set of a function 


The rule associated with a function tells us how to find the image of any 
element in the domain. Often, however, we need to consider the images of 
all elements in some subset of the domain. The subset of the codomain 
containing these images is called the image of the original subset, as stated 
below and illustrated in Figure 40. 


Definition 


Let f : A—> B be a function. For any subset S of A, the image of S$ 
under f, denoted by f(S), is the set 


domain A codomain B 


Figure 40 Image of a set S 
under a function f TO) = {ee E Sj- 


40 


3 Functions 


Worked Exercise A10 


Find f(S), where S = {1,2,3} and 


f:R— R 
1 
LS =; 
Ey be 
Solution 


f(S) = {f@), f@), £(3)} = {1 $, 4}. 


Exercise A27 


Let 
f:R— R 
trou. 
Find the image under f of each of the following sets. 
(a) -S =10,1,2,2} b) Z 


The idea of the image of a subset of elements is useful in geometry, for 
example, where we frequently want to consider the effect of a 
transformation on a plane figure, a subset of R?. For example, suppose 
that S is the square with vertices at (0,0), (1,0), (1,1) and (0,1), and we 
want to find the image of S under the function 

f:R?—-R 

(x,y) > (x + 2,y). 
This function is the translation of the plane that moves each point (x,y) to 
the right by 2. The image of S is therefore the square with vertices at 
7 (0,0) = (2,0), 71,0) = (3,0), f£(1,1) = (3,1) and 7 (0,1) = (2,1), as 


shown in Figure 41. 


y 


Figure 41 The image f(S) of a square S under a translation f 


Sometimes we want to consider the image of the whole domain of a 


function: this set is called the image set of the function, as illustrated in domain A codomain B 
Figure 42. Figure 42 Image set of a 
function f 


41 


Unit A1 Sets, functions and vectors 


Definition 
The image set of a function f : A —> B is the set 


f(A) = (f(a) : x € A}. 


The image set of a function is a subset of its codomain. It need not be 
equal to the codomain because there may be some elements of the 
codomain that are not images of elements in the domain. 


In some texts, the image set of a function is called the image of the 
function, or the range of the function. 


When the domain of a function f has a small number of elements, we can 
find the image set of f by finding the image of each element in the domain, 
and listing them to form a set. 


Worked Exercise A11 


Let A = {—3, —2, —1,0, 1,2,3} and B = {0,1, 2,3, 4, 5, 6, 7, 8, 9}. 
Find the image set of the function 
f:4— B 
re r’. 


Exercise A28 


Let A = {0, 1, 2,3, 4, 5, 6, 7, 8, 9}. 
Find the image set of the function 
f:A4— A 
ti 9 = i. 


You should have found that for the particular function in Exercise A28 the 
image set and the codomain are the same set. In other words, each 
— element of the codomain is the image of an element in the domain, as 


domain A codomain B illustrated in Figure 43. A function with this property is said to be onto. 
Figure 43 An onto function: 
FA) =B 


42 


Definition 
A function f : A — B is onto if f(A) = B. 


Some texts refer to an onto function as a surjective function. 


Exercise A29 
Which of the following diagrams represent(s) an onto function? 


(a) (b) (c) 


‘pe Lf Sa 


You have seen that if the domain of a function is a small finite set, then we 
can find the image set of the function by finding the image of each element 
of the domain individually. If the domain is a large finite set or an infinite 
set, then we need an algebraic argument to determine the image set. 
Sometimes we ‘guess’ what the image set seems to be, and then confirm 
this algebraically. 


For a real function, a sketch of its graph can help us ‘guess’ the image set. 
For a function that is a transformation of the plane, we can use our 
knowledge of such transformations to help us ‘guess’ the image set. 


To show that the image set is equal to our ‘guess’ set, we use our usual 
strategy for showing that two sets are equal: we show that each is a subset 
of the other. 


e To show that the image set is a subset of our ‘guess’ set, we show that 
the image of an arbitrary element of the domain lies in our ‘guess’ set. 

e To show that our ‘guess’ set is a subset of the image set, we take an 
arbitrary element of our ‘guess’ set and find an element of the domain 
whose image is this arbitrary element. 


Worked Exercise A12 


For each of the following functions, find its image set and determine 
whether it is onto. 
(a) f:R— R (b) f:R—R (c) f : R? — R? 

Lr— 24-5 cr x (x,y) — («+ 1,y+2) 


3 Functions 


43 


Unit A1 Sets, functions and vectors 


Solution 
(a) A sketch of the graph of f is shown below. 


®. For every element on the y-axis, a horizontal line drawn 
through that element meets the graph. So it seems that every 
element of the codomain is the image of some element of the 
domain. That is, we ‘guess’ that the image set f(IR) is the whole 
codomain R. & 


We prove that f(R) =R. 


®. The image set is always a subset of the codomain; in this case 
the codomain is R, so f(R) CR. @ 


We know that f(R) C R, so we must show that f(R) 2 R. 


@®. We take an arbitrary element in our ‘guess’ set R, and find an 
element in the domain R whose image is this arbitrary 
element. © 


Let y be an arbitrary element in R. We must show that y € f(R); 
that is, there exists an element x in the domain R such that 


(co) =a that is, 225 E= 


Rearranging this equation, we obtain 


popes a 
7 2 
which is in the domain R. So we have 
(e227 —5 
y+s 
=2| =— ]—5 
oy 
= oh 
that is, for every y € R there is an x in the domain R such that 
f(z) =y. 


Thus f(R) 2 R. 


Since f(R) C R and f(R) 2 R, it follows that f(R) = R, so the 
image set of f is indeed R. 


The codomain of f is also R, so f is onto. 


44 


(b) A sketch of the graph of f is shown below. 


y 


®. For every element in the interval [0, o0) of the y-axis (marked 
on the sketch), a horizontal line drawn through that element 
meets the graph. For any element outside this interval, such a 
horizontal line does not meet the graph. So we ‘guess’ that the 
image set f(R) is [0,co). & 

We prove that f(R) = [0, 00). 

®. We know that the image set is a subset of the codomain R, 
but we don’t know that it is a subset of [0, o0). We have to show 
algebraically that f(R) C [0,00) by finding the image of an 
arbitrary element in the domain R. .© 

Let x be an arbitrary element in the domain R; then f(x) = 2”. 
Now, x? > 0 for all x € R, so f(R) C [0, co). 

We must now show that f(R) 2 [0, 00). 

®. We take an arbitrary element in our ‘guess’ set [0,00), and 
find an element of the domain R whose image is this arbitrary 
element. .©& 


Let y be an arbitrary element in [0,00). We must show that 
there exists an element x in the domain R such that 


f(z)=y; thatis, «7 =y. 
Now z = ,/y is in R (since y > 0) and satisfies f(x) = y, as 
required. Thus f(R) 2 [0, 00). 
Since f(R) C [0,00) and f(R) 2 [0, 00), it follows that 
f(R) = [0, co), so the image set of f is [0, 00). 
The image set f(R) = [0, 00) is not the whole of the codomain R, 
so f is not onto. 


®. If we had simply been asked to determine whether f is onto, 
we could have shown that it is not by finding just one element, 
say —1, in the codomain R that is not the image of an element of 
the domain R. @& 


®. This function is a translation of the plane (it shifts each point 
to the right by 1 unit and up by 2 units). So we expect (‘guess’) 
the image set to be the plane R?. & 


We prove that f(R?) = R?. 


3 Functions 


45 


Unit A1 Sets, functions and vectors 


46 


Exercise A30 


For each of the following functions, find its image set and determine 
whether it is onto. 
(a) f:R— R (b) f :R?— R? 

r l +z? (x,y) — (z, —y) 


As you have seen from Worked Exercise A12 and Exercise A30, when you 
want to determine whether a function is onto, it is crucial to take into 
account what the codomain of the function is. For example, you saw in 
Worked Exercise A12 that the function 


f:R—R 
Er x 


is not onto. To see this, you just have to observe that the element —1, for 
example, of the codomain is not the image of any element of the domain. 
However, if you remove all the negative numbers from the codomain of this 
function, then you obtain the new function 


g : R — [0, 00) 


rr x7, 


which is onto, since every element of the codomain is an image of an 
element of the domain. Note that these functions f and g are different 
functions, since they have different codomains. 


3.3 Inverse functions 


Given a function 
f:A—B 
z ı—> f(z), 
it is sometimes possible to define an inverse function that ‘undoes’ the 
effect of f by mapping each image element f(x) back to the element x 
whose image it is. For example, a rotation in the plane can be ‘undone’ by 
a rotation in the opposite direction. 
However, consider the function 
f:A—-B 
ze 2, 
where A = {—3,—2, —1,0, 1,2,3} and B = {0,1, 2,3, 4,5, 6, 7,8, 9}. 
We know that f(—2) = f(2) = 4, and so a function that ‘undoes’ the effect 
of f must map the number 4 to the number —2 and to the number 2, 
which is impossible. Thus, in this case, no inverse function exists. This 
function f is an example of a function that is many-to-one. A many-to-one 
function does not have an inverse function. 


Definitions 


A function f : A — B is one-to-one if each element of f(A) is the 
image of exactly one element of A; that is, 


if 71,22 € A and f(x1) = f(z2), then x71 = xe. 


A function that is not one-to-one is many-to-one. 


Thus a function f is one-to-one if it maps distinct elements in the domain 
A to distinct elements in the image set f(A). Some texts refer to a 
one-to-one function as an injective function. 


To prove that a function f is not one-to-one (that is, that the function is 
many-to-one), it is sufficient to find just one pair of distinct elements in 
the domain A with the same image under f. 


Exercise A31 
Which of the following diagrams represent(s) a one-to-one function? 


(a) (b) (c) 


va \_-4 bd 


3 Functions 


47 


Unit A1 Sets, functions and vectors 


48 


If the domain of a function is a large finite set or an infinite set, then to 
show that the function is one-to-one, we need an algebraic argument. We 
aim to show algebraically that, if two elements of the domain have the 
same image under the function, then they must actually be the same 
element, as demonstrated in Worked Exercise A13. 


Showing that a function is not one-to-one is more straightforward: we just 
give a pair of distinct elements that have the same image under the 
function, as you have seen. 


For a real function, an initial sketch of its graph can help us ‘guess’ 
whether or not the function is one-to-one, and if it is not one-to-one, the 
graph can also help us find a pair of elements that show this. 


Worked Exercise A13 


Determine which of the following functions are one-to-one. 
(a) f:R— R (b) f:R— R (c) f: R? — R? 
Lr 2r —5 rH x? (x,y) — (z + 1,y +2) 


(These are the same functions as in Worked Exercise A12.) 


Solution 
(a) A sketch of the graph of f is shown below. 


®. Each horizontal line meets the graph just once. So it seems 
that no element of the codomain is the image of more than one 
element of the domain. That is, it seems that f is one-to-one. To 
prove this, we show that if two elements x; and x2 in the domain 
have the same image, then they must actually be the same 
element. © 


We show that f is one-to-one. Suppose that f(x1) = f(x2); then 
2x1 — 5 = 242 — 5, 
so 2x1 = 2x2, and hence x1 = 2. 


Thus f is one-to-one. 


(b) A sketch of the graph of f is shown below. 


| 
| 
—?2 2 T 


®. Some horizontal lines meet the graph more than once. So it 
seems that f is not one-to-one. To show this, we find two distinct 
elements of the domain with the same image. © 


This function is not one-to-one since, for example, 
P) 2 a 


(c) ®. This function is a translation of the plane, so we expect it to 
be one-to-one. .@ 


We show that f is one-to-one. Suppose that f(x1, y1) = f(a, y2); 
then 


(Con ae L0 = 2) = Ca F lya F 2): 
Thus 

a+l=a24+1 and y+2=y.4+2, 
so 

tı=qďđ2 and yi = yo. 


Hence (21, y1) = (£2, y2), so f is one-to-one. 


Exercise A32 


Determine which of the following functions is one-to-one. 
(a) f:R—R (b) f:R?— R? 
pi Tea (x,y) > (z, —y) 


(These are the same functions as in Exercise A30.) 


For a one-to-one function f : A —> B, we have the situation illustrated in 
Figure 44. Each element y in f(A) is the image of a unique element x in 
A, and so we can reverse the arrows to obtain the inverse function with 
domain f(A) and image set A, which maps y back to x. When it exists, we 
denote the inverse function of f by f7!. 


3 Functions 


domain image 


f(A) set A 


Figure 44 A function and its 
inverse 


49 


Unit A1 Sets, functions and vectors 


50 


Definition 
Let f : A — B be a one-to-one function. Then f has an 
inverse function f~t : f(A) — A, with rule 


fy) = where y = 9 (a) 


Notice in this definition that the domain of f~ is f(A); it is not 
necessarily the whole of B. 

However, if a function f : A —> B is onto, as well as one-to-one, then f 
has an inverse function f~! : B — A; that is, the domain of f~! is the 
whole of B. 

A function f : A —> B that is both one-to-one and onto is said to be a 
one-to-one correspondence between the sets A and B. For such a 
function f, not only is f~! the inverse of f, but also f is the inverse of 
f~}; that is, the functions f and f~! are inverses of each other. 


Some texts refer to a one-to-one correspondence as a bijection. 


Worked Exercise A14 


For each of the following functions, determine whether f has an inverse 
function f—!; if it exists, find it. 
(a) f:R— R (b) f:R— R (c) f : R? — R? 

Lr— 2r — 5 re r? (x, y)-—> (z +1,y4+ 2) 


(d) f: [0,c0) — [-1, ov) 
rı 3r? — 1 


(d) 


®. It does not matter whether the definition of f~! is expressed 
in terms of x or y, but it is more usual to use x in the definition 
of a real function. & 


This definition can be expressed in terms of x as 
R= R 


PE A 
= 


In Worked Exercise A13(b), we showed that f is not one-to-one, 
so f does not have an inverse function. 


In Worked Exercise A13(c), we showed that f is one-to-one, so f 
has an inverse function. 


In Worked Exercise A12(c), we showed that the image set of f is 
R? and that, for each (a’,y’) in the image set R?, we have 


(2’,y') = f(a,y) = f(x’ -1,y’ — 2). 
@. Under f, we know that (z’, y’) is the image of (2’ — 1, y’ — 2), 
so under the inverse, (x’ — 1, y' — 2) is the image of (z’,y’). #& 
So fT! is the function 

fk? 3 RP? 

Ge ly 2) 

This definition can be expressed in terms of x and y as 

fC R — R? 

(x,y) — (z — 1,y-— 2). 

®. This makes sense: geometrically, f is the translation that 
shifts each point to the right by 1 unit and up by 2 units, so we 


expect the inverse to be a translation to the left by 1 unit and 
down by 2 units. & 


A sketch of the graph of f is shown below. 


3 Functions 


51 


Unit A1 Sets, functions and vectors 


52 


®. Each horizontal line meets the graph just once. So it seems 
that f is one-to-one. To prove this, we show that if two elements 
zı and x2 in the domain have the same image, then they must 
actually be the same element. ©& 


We show that f is one-to-one. Suppose that f(x1) = f(x2); then 
37, lsg =l, 

so 3x? = 3x3, and hence z? = x3. Since both zı and 2 are in the 

domain [0,00), this implies that zı = x2. 

Thus f is one-to-one. 


@. We now find the image set of f. From the sketch, we ‘guess’ 
that it is [—1,00), the codomain of f. That is, we guess that f is 
onto. & 

We prove that f([0,00)) = [—1, c0). 

®. The image set is a subset of the codomain. .© 

We know that f([0,00)) C [—-1, 00), so we must show that 

Ff (0, 00)) 2 [-1, ov). 

@®. We take an arbitrary element in our ‘guess’ set [—1,00), and 


find an element of the domain [0,00) whose image is this 
arbitrary element. © 


Let y be an arbitrary element in [—1,00). We must show that 
there exists an element x in the domain [0, o0) such that 

f(z)=y; thatis, 327—1=y. 
Rearranging this equation, we obtain 

g url 
r=. 
3 

Since y € [—1, 00), we know y + 1 > 0, so 


m = Uae 
= 


is in the domain [0, 00). So we have 
f(a) =327 —1 


=, 
that is, for every y € [—1, 00) there is an x € [0, 00) such that 


f(x) =y. 


Exercise A33 


For each of the following functions, determine whether f has an inverse 

function f~t and, if it exists, find it. 

(a) f:R— R (b) f : R? — R? (c) f:R— R 
rm l +r? (x,y) — (z, -y) re> 8r +3 


(For parts (a) and (b), use your answers from Exercises A30 and A32.) 


Restrictions 


When we are working with a function f : A —> B, it is sometimes 
convenient to restrict attention to the behaviour of f on some subset C of 
A. For example, consider the function 
f:R—R 
ge r. 
This function is not one-to-one and so does not have an inverse function. 
However, if the domain of f is replaced by the set C = [0, 00), then we 
obtain a related function, 
g:C — R 
r= r?°, 
shown in Figure 45. The rule is the same as for f, but the domain is 


‘restricted’ to produce a new function g that is one-to-one and so has an 
inverse. 


The function g is an example of a restriction of f in the sense that 
g(x) = f(x) for all x in the domain of g. 


3 Functions 


Figure 45 The function g 
with domain [0, co) 


53 


Unit A1 Sets, functions and vectors 


More generally, we define a restriction, illustrated in Figure 46, as follows. 


Definition 
Let f : A —> B and let C be a subset of the domain A. Then the 
function g : C —> B defined by 


g(a) = f(x), fora ec, 


is the restriction of f to C. 


Figure 46 The function g is 
the restriction of f to C 


Exercise A34 


Let f be the function 
f :R — |-1,1] 
re sinz. 


Write down a restriction of f that is one-to-one. 


3.4 Composite functions 


In Subsection 3.1, you saw how a function may be regarded as a machine 
that processes elements in the domain to produce elements in the 
codomain. Now suppose that two such machines are linked together, so 
that the elements emerging from the first machine are fed into the second 
machine for further processing. The overall effect is to create a new 
‘composite’ machine that corresponds to a so-called composite function. 


For example, consider the real functions 


f:R— R g:R—R 
2 and 
Cr> T DH 2g — 5. 
When the machines for f and g are linked together so that elements are 


first processed by f and then by g, we obtain the ‘composite’ machine 
illustrated by the large box in Figure 47. 


gof 
— 
f g 
— — 
4 


2—5 z z pe Ct 20 e3 


Figure 47 The composite function go f as a machine 


For instance, when 2 is fed into the machine, it is first squared by f to 
produce the number 4, and then 4 is processed by g to give the number 
(2x 4)-5=3. 


54 


Similarly, when an arbitrary real number « is fed into the machine, it is 
first processed by f to give the real number x”. Since z? lies in R, the 
domain of g, the number z? can then be processed by g to give 2x? — 5. 
Thus, overall, the composite machine corresponds to a function, which we 
denote by go f, whose rule is 


(90 f)(x) = 9(f(x)) = g(a) = 2? — 5. 


In general, we have the following definition. 


Definition 
Let f : A — B and g: B — C be two functions such that the 
domain of g is the same set as the codomain, B, of f. Then the 
composite function go f is given by 
gof: A— C 
z> g(f(x)). 


Notice that go f means f first, then g. 


Exercise A35 


Let f and g be the functions 


f:R— R g:R— R 
and 
LH > -T rr 3z + 1. 


Determine the composite functions 


(a) gof, (b) fog. 


In general, the composite functions go f and f o g are not equal, as you 
saw in Exercise A35. 


Composite functions have many uses in mathematics; for example, we can 
use them to examine the effect of one transformation of the plane followed 
by another. 


Suppose, for instance, that f and g are the reflections of the plane in the 
x-axis and y-axis respectively: 
f:R — R? g:R — R? 
and 
(x,y) —> (x, —y) (x,y) —> (=x, y). 
The composite function g o f describes the overall effect of first reflecting 


in the x-axis (changing the sign of y) and then reflecting in the y-axis 
(changing the sign of x), as shown in Figure 48. The rule of go f is 


(go f)(z,y) = 9(f(z,y)) = g(x, —y) 
= (—x,—y). 


3 Functions 


YA 
(x,y) 
e 
gof fo 
x 
e<__ ® 
(=z, —y) g (f; —y) 
Figure 48 The composite 
gof 


55 


Unit A1 Sets, functions and vectors 


Thus go f is the function 
go f: R? — R? 
(x,y) => (=f; —y), 
which rotates the plane through an angle 7 about the origin, as can be 
seen by considering Figure 49, which shows how a square is transformed by 


gof. 
gof 
e , 
f 1 F 1 


“T (1,-1) 71 


Figure 49 The composite function go f transforming a square 


Exercise A36 


Determine the composite function f o g, where f and g are the reflections 
of the plane in the z-axis and y-axis respectively, as defined above. 


So far, we have considered the composite function g o f only when the 
domain of the function g is the same as the codomain of the function f. 
We can, however, form the composite function go f when g and f are any 
two functions. 


For example, consider the functions 


f:R— R g:R-{1}—R 
j and 1 
Li 2 x£ eed 


x—-1 
Recall that R — {1} is the set of all real numbers with 1 excluded. 


Here the domain of g is not equal to the codomain of f, but we can still 
consider the composite function go f, with the rule 


(go f)(2) = 9(f(x)) = g(a?) = 


v2 — 1 


56 


However, we have to be careful about the domain of go f. It cannot be the 
whole of R, the domain of f. To see this, consider what happens when we 
try to feed the number 1 into the ‘machine’ corresponding to go f, as 
shown in Figure 50. 


1 
1e xr > x? l cr > ‘ieee output 
| T= 


Figure 50 An input number that cannot be ‘processed’ by go f 


If we try to feed the number 1 into the machine, then it can be processed 
by f to produce the number 1, but 1 cannot then be processed by g, since 
it is not in the domain of g. We have the same problem if we try to feed 
the number —1 into the machine. However, if we feed any other number in 
the domain of f into the machine, then it can be processed by f and then 
g to produce a final output number. So we take the domain of go f to be 
R — {1,—1}. Thus the composite function go f is 

gof:R-{1,-1} — R i 
In general, if f and g are any two functions, then we take the domain of 
the composite function go f to consist of all the elements in the domain of 
f such that f(a) is in the domain of g. The codomain of go f is always the 
same as the codomain of g. So we have the following definition. 


Definition 
Let f : A — B and g: C — D be any two functions; then the 
composite function go f has: 


e domain {x € A: f(x) € C} 
e codomain D 


G rule (go NE = 9G (a) 


This definition allows us to consider the composite of any two functions, 
though in some cases the domain may turn out to be the empty set Ø. 
However, some texts insist on f(A) C C as a condition to ensure go f 
exists. 


3 Functions 


57 


Unit A1 Sets, functions and vectors 


In the example above with 
gof:R—-{1,-1} — R 
1 
x? — 1’ 
the domain of g o f is just the set of values for which the rule of go f is 
defined. This is not always the case, as illustrated in the following worked 
exercise where the domain of f is not the whole of R. 


Lt 


Worked Exercise A15 


Determine the composite function go f for the following functions f and g: 


f : [0, 27) — [-1,1] 


g:R-{-1} — R* 
ee sina 1 


e+ 


Solution 
®. The composite function g o f means f then g. ® 
The rule of g o f is 


Gee) =o seme) = —— 


sinz +1 


@. A number z is in the domain of go f if it is in the domain of f and 
f(z) is in the domain of g. & 


The domain of go f is 
{x € [0,27) : f(z) € R— {-1}}. 
If x € [0, 27), then f(z) € R— {—1} unless f(x) = —1. 


Now f(x) = —1 means sinx = —1, and the only value of x in [0, 27) 
such that sinz = —1 is 
_ 3a 
Dp 


®. The domain is complicated to write down so it helps to give it a 
name, say D. @ 


So the domain of go f is 
D = (0, 27) — {37/2}. 
Thus go f is the function 
go f: D — R* 
1 
sing +1 


58 


Notice that, as claimed, in the worked exercise above the domain of go f is 
not the full set of values for which go f is defined. The full set of values for 
which g o f is defined is 


{x €R:sing £-1}=R-{(2n-4)a7:neEZ}. 


Exercise A37 
Determine the composite function go f for the following functions f and g: 


f:[-11J—-R ee a 
rte 3x + 1 


eS ‘ 
xr+2 


Using function composition to show that a function is 
the inverse of another function 


Suppose that f : A —> B is a one-to-one and onto function. Then f has 
an inverse function f~t : B — A. We can therefore consider the effect 
that the composite function f~! o f : A — A has on an arbitrary element 
z in A. First, f maps x to an element y = f(x) in B. Then f~! ‘undoes’ 
the effect of f and maps y back to x, as illustrated in Figure 51. Overall, 
the effect of f~t o f is to leave x unchanged, or fixed: that is, 

(f-!o f)(x) =a. Since x is an arbitrary element of A, it follows that 

f ‘of fixes all the elements of A. In other words, f~!o f = i4, the 
identity function on set A. 


Figure 51 The composite function f~!o f 


A similar argument can be used to show that f o f~! = ig. So, if 
f: A — B has an inverse function f~! : B — A, then 


f-iof=i,g and fof '=ig. 


The converse of this statement is also true: that is, if a function 
g : B — A satisfies 


g°of=ia and fog=iz, 


then g is the inverse function of f. A proof of this is given after 
Exercise A39. It leads to the following strategy. 


3 Functions 


59 


Unit A1 Sets, functions and vectors 


Strategy A2 


To show that the function g : B —> A is the inverse function of the 
function f : A —> B: 


1. show that g(f(x)) =z for each x € A; that is, go f =i, 
2. show that f(g(y)) = y for each y € B; that is, fog = ip. 


In practice, we can sometimes use Strategy A2 as an alternative way of 
finding an inverse function. We make an inspired guess at the inverse 
function, and use Strategy A2 to check that our guess is correct. 


Worked Exercise A16 


Use Strategy A2 to find the inverse of the function 
f:R—R 


Exercise A38 
Use Strategy A2 to show that g is the inverse of f, where 


f:R—OR ee 
and r+3 
cr > ba —3 a z 


60 


Exercise A39 


Use Strategy A2 to find the inverse of the function 
f:R—-R 


To end this section, here is the promised proof that if the functions 
f:A— B and g: B — A satisfy 
gof=ta and fog=ip, 


then g is the inverse function of f. That is, we prove that if the two steps 
of Strategy A2 hold, then f has an inverse function, and the inverse 
function is equal to g. 


Suppose, then, that the two steps of Strategy A2 hold. First we show that 
f is one-to-one. 


Suppose that f(x1) = f (x2); then 


g(f(x1)) = g(f(x2)), 


so, since g(f(x)) = x for each x € A by the first step of Strategy A2, we 
have zı = zə. Thus f is one-to-one and so it has an inverse function f~t. 


Now we find the image set of f. 


We know that the image set of f is a subset of its codomain B, so 

f(A) C B. We now show that f(A) D B by showing that every element y 
of B is the image under f of some element in A. Suppose that y € B. 
Then, by the second step of Strategy A2, 


f(g(y)) =y; 


that is, y is the image under f of the element g(y) and g(y) € A, as 
required. Therefore f(A) D B. 


Since f(A) C B and f(A) 2 B, it follows that the image set of f is B (that 
is, f is onto), and so f~! has domain B. 


We now know that each of the functions f~! and g has domain B and 
codomain A. To show that they are equal, it remains to show that 
gly) = ft (y) for each element y of B. 


Let y be an arbitrary element of B. Then y = f(x) for some element x of 
A. So 


f(y) =a, 
and, by the first step of Strategy A2, 
gy) = g(f(&)) = x. 


Hence f~! and g are indeed equal functions. 


3 Functions 


61 


Unit A1 Sets, functions and vectors 


a < 
em 
ae 


Figure 52 The same vector 
represented in different ways 


4 Vectors 


In this section you will revise vectors, in both the plane R? and in 
three-dimensional space R°. Vectors are used throughout Book C Linear 
algebra. 


4.1 What is a vector? 


A mathematical or physical quantity that has a direction as well as a size 
is called a vector, or a vector quantity. An example of such a quantity 
is velocity: to state the velocity of a car you have to give its speed and also 
the direction in which it is moving. In contrast, some mathematical and 
physical quantities, such as temperature and volume, have only a size — 
they have no direction associated with them. We call such quantities 
scalars, or scalar quantities. When discussing vectors and scalars, we 
usually use the term magnitude, rather than size. 


Definition 


A vector is a quantity that is determined by its magnitude and 
direction. A scalar is a quantity that is determined by its magnitude. 


We can represent a vector in R? or in R? geometrically by a line segment 
with an arrowhead, as illustrated in Figure 52. The length of the line 
segment is a measure of the magnitude of the vector, and the direction of 
the arrowhead indicates the direction. The starting point of the line 
segment does not matter; for example, all the line segments with 
arrowheads in Figure 52 represent the same vector. We can draw the 
arrowhead at the end of the line segment, or in the middle of it, as 
convenient. A vector represented by a line segment from A to B, with an 
arrowhead pointing from A to B, can be written as AB. 


Often we use single letters, such as a, b, p, q or v, to denote vectors. 
Vectors are usually distinguished in print by the use of a bold typeface, 
and in handwritten work by underlining the letters (for example, v). These 
are important conventions as they clearly distinguish vector quantities 
from scalar quantities. 


We denote the magnitude of a vector v by the notation |v]. 


There is one vector that does not fit conveniently into the definition above; 
namely, the zero vector. It represents any vector quantity that has 
magnitude zero and hence has no direction, such as the velocity of a 
stationary car. 


Definition 
The zero vector is the vector whose magnitude is zero, and whose 
direction is undefined. It is denoted by the symbol 0. 


The next box defines what it means to say that two vectors are equal. 


Definition 

Two vectors a and b are equal if: 

e they have the same magnitude; that is, |a| = |b| 
e they are in the same direction. 


We write a= b. 


For example, in Figure 53, the vector v is equal to the vector d, but is not 
equal to any of the other vectors, as they all differ from v in magnitude or 
direction. 


hfe N 


Figure 53 A selection of vectors in the plane 


We now briefly revise some other definitions relating to vectors. 


Definition 
The negative of a vector v is the vector that has the same magnitude 
as v, but the opposite direction. It is denoted by —v. 


For example, in Figure 53 we have bee If we write v as AB for 
suitable points A and B, then —v = BA, as shown in Figure 54. 


Scalar multiple of a vector 


Let k be a scalar and v a vector. The scalar multiple kv of v is the 
vector: 


e whose magnitude is |k| times the magnitude of v; that is, 
|v] = |k] [vl 

e that has the same direction as v if k > 0, and the opposite direction 
w e< 0 


If k = 0, then kv = 0. 


4 Vectors 


A BA=-AB 


Figure 54 ‘The vectors v and 
—v 


63 


Unit A1 Sets, functions and vectors 


For example, in Figure 53 we have c = 2v, since c has the same direction 
as v but twice the magnitude, and e = —3f , since e has the opposite 
direction to f and its magnitude is 3 times that of f. 


Exercise A40 


For each of the vectors shown below, decide whether it is a multiple of any 
of the other vectors; if it is, write down an equation of the form vı = kv2 
that specifies the relationship between them. 


Yr 


Xy 


Exercise A41 


For the vector d in Exercise A40, sketch 3d and —2d. 


We can add two vectors using either of the two laws below. They give the 
same result, as illustrated in Figure 55. 


Triangle Law for addition of vectors 

The sum p + q of two vectors p and q is obtained as follows. 

1. Starting at any point, draw the vector p. 

2. Starting from the tip of the vector p, draw the vector q. 

Then the sum p + q is the vector from the tail of p to the tip of q. 


Parallelogram Law for addition of vectors 
The sum p + q of two vectors p and q is obtained as follows. 
1. Starting at the same point, draw the vectors p and q. 


2. Complete the parallelogram of which these vectors are adjacent 
sides. 


Then the sum p + q is the vector from the point where the tails of p 
and q meet to the opposite corner of the parallelogram. 


64 


p 


(a) 
Figure 55 The sum p+ q obtained by (a) the Triangle Law (b) the 


Parallelogram Law 


Addition and scalar multiplication of vectors obey the usual rules of 
algebra. The most important of these are listed in the box below. 


Properties of vector algebra 


Let p, q and r be vectors, and let a,b € R. The following properties 
hold. 


Commutativity p+q=q+p 
Associativity (p+q)+r=p+(q+r) 


Distributivity a(p+q) = ap + aq, 
(a + b)p = ap + bp. 


Finally, we define subtraction of vectors in terms of addition and the 
negative of a vector, as follows, and as illustrated in Figure 56. 


Definition 
The difference p — q of the vectors p and q is 


p—q=p+ (-q). 


Figure 56 ‘The difference p — q of vectors p and q 


Since the vector —q has the same magnitude as q but the opposite 
direction, we can draw p — q by using either of the two constructions that 
we use for adding vectors. 


In general, q — p does not equal p — q; in fact, as you would expect, 


q-p=-(p-q). 


4 Vectors 


65 


Unit A1 Sets, functions and vectors 


66 


Exercise A42 


For the vectors p and q shown below, sketch p+ q, p — q and 2p + $4. 


Yr 


Sy 


4.2 Components and the arithmetic of 
vectors 


We can sometimes simplify the manipulation of vectors by expressing them 
in component form. To do this, we start by defining the following unit 
vectors, shown in Figure 57. A unit vector is a vector of magnitude 1. 


In R?, the vectors i and j are the unit vectors in the positive 
directions of the z- and y-axes, respectively. 


In R?, the vectors i, j and k are the unit vectors in the positive 
directions of the x-, y- and z-axes, respectively. 


YA 

24 

Ji 
oi 
12 7 


Figure 57 The unit vectors i, j and k 


Any vector in R? can be expressed as the sum of scalar multiples of i and 
j, and similarly any vector in R? can be expressed as the sum of scalar 
multiples of i, j and k. For example, the vector v in Figure 58(a) can be 
expressed as 


v = 3i + 4j, 
and the vector w in Figure 58(b) can be expressed as 
w= 2i +4j + 3k. 


These expressions are the component forms of v and w. 


(a) 
Figure 58 (a) A vector v in R? (b) A vector w in R’ 


In general we have the following. 


Definitions 
Any vector p in R? can be expressed in component form as 
p = aji +a2j, for some real numbers aj, ag; 


we often write p = (a1, a2), for brevity. The numbers a, and az are 
the components of p in the z- and y-directions, respectively. 


Any vector p in R? can be expressed in component form as 
p = aji +a2j +a3k, for some real numbers a1, a2, a3; 


we often write p = (a1, a2, a3), for brevity. The numbers a1, a2 and a3 
are the components of p in the z-, y- and z-directions, respectively. 


So, for example, the component form of the vector v in Figure 58(a) is 
31+ 4j, or, equivalently, (3,4). 

Similarly, the component form of the vector w in Figure 58(b) is 
2i + 4j+3k, or, equivalently, (2,4,3). 


In some texts, the ordered pairs and ordered triples that represent the 
component forms of vectors are written vertically, as 


2 


for example, to distinguish them from points. Although we write them 
horizontally in this module, the meaning of an ordered pair or ordered 
triple should be clear from the context. 


Exercise A43 


Sketch the following vectors in R? on a single diagram: 


2i— 3j, —3i+4j, —2i— 2j. 


4 Vectors 


67 


Unit A1 Sets, functions and vectors 


In the box below, the operations on vectors that were described 
geometrically in Subsection 4.1 are expressed in terms of components. The 
component forms of the vectors are expressed as ordered pairs and ordered 
triples in the box; there are analogous formulas for vectors expressed in 
terms of the unit vectors i, j and k. For example, the zero vector in R? can 
be written as 0 = 0i + 0j rather than as 0 = (0,0). 


Vector arithmetic in component form 
Equality Two vectors, both in R? or both in R?, are equal if their 
corresponding components are equal. 
Zero vector ‘The zero vector is 
0=(0,0) in R’, 
0=(0,0,0) in R®. 
Addition To add vectors in R? or in R°, add their corresponding 
components: 
(a1, @2) + (b1, bz) = (a1 + b1, a2 + b2), 
(a1, G2, a3) + (b1, b2, b3) = (a1 + b1, a2 + be, a3 + b3). 
Negatives To find the negative of a vector in R? or in R®, take the 
negatives of its components: 
= (a1, 42) = (—a1, —az), 
= (a1, a2, a3) = (Sar, —a2, Oa). 
Subtraction To subtract vectors in R? or in R?, subtract the 
corresponding components: 


(a1, a2) — (b1, b2) = (a1 — b1, a2 — b2), 
(a1, @2, a3) — (b1, b2, b3) = (a1 — b1, a2 — b2, a3 — 53). 


Scalar multiplication To multiply a vector in R? or in R? by a 
real number k, multiply each component by k: 


k(ay, a2) = (kay, ka), 
Kaine a2, a3) = (kar, kag, kag). 


Magnitude The magnitude of the vector (a1, a2) in R? is 


[rA ee? 
aj + Ce 


The magnitude of the vector (a1, a2, a3) in R is 


hae D i ge 
ay + a3 + a3. 


The formulas for magnitude are derived from the distance formulas for R? 
and R? that you met in Section 1. 


68 


4 Vectors 


Here are some examples of vector arithmetic in component form, in R?: 
the sum of two vectors, 


(1, -3) + (4,2) = (1 + 4,—3 + 2) = (5, —1), 
the negative of a vector, 

re = (-1,3), 
and a scalar multiple of a vector, 

2(2,—1) = (4, —2). 
The magnitude of the vector (1,—3) is given by 


JET CH = VIF8= vio 


Exercise A44 


For each of the following pairs of vectors p and q, write down p +q, ~q 
and p- q. 


(a) p= (3, -1) and q = (—1, —2). 
(b) p= —i-— 2j and q = 2i — J. 
(c) p=-—i+2k andq=si-2j-k. 


Exercise A45 


For each of the following pairs of vectors p and q, determine 2p, 3q and 
2p — 3q, and find the magnitude of q. 


(a) p= (3,—1) and q = (-1, —2). 
(b) p=-—i+2k andq=i-2j-k. 


Unit vectors 


As you saw earlier, a unit vector is a vector of magnitude 1. We denote i 
the unit vector that is in the same direction as a particular vector v by V po 
(read as ‘v hat’), as illustrated in Figure 59. * v 
To find V, we multiply v by the reciprocal of its magnitude, as follows. 
Figure 59 A vector v and its 


The unit vector in the same direction as a vector v is corresponding unit vector V 
Ey il 
v= —v 
[v] 


The exception to this notation for unit vectors is that we use the special 

symbols i, j and k for the unit vectors in the positive directions of the z-, 
y- and z-axes, as you saw earlier. This is common practice, though some 

texts use the alternative symbols X, y and Z for these vectors. 


69 


Unit A1 Sets, functions and vectors 


T T 5 x 
(2, =1) 


Figure 60 The position 
vector of the point (2, —1) 


70 


Worked Exercise A17 


Find v for v = (3,4). 


Exercise A40 


Find v for each of the following vectors v. 
(a) (2,—3) (b) 5i+ 12j 


Position vectors 


There is a natural and useful way to associate every point in the plane or 
in three-dimensional space with a vector. We make the following definition. 


Definition 

Let P be any point in R? or R°. The position vector of P is the 
vector whose starting point is the origin and whose finishing point 
is P, that is, the vector OP, where O is the origin. 


For example, the position vector of the point P(2,—1) is the vector 
OP =A- j (often written as (2,—1)), as shown in Figure 60. 


In general, any point (x,y) in R? has position vector xi + yj (often written 
as (x, y)), and similarly any point (x,y,z) in R? has position vector 
xi + yj + zk (often written as (2, y, z)). 


Exercise A47 


Let p and q be the position vectors of the points (5,3) and (1,4), 
respectively. 


(a) Determine the vectors p — q, p + q and $p + 54. 


(b) Sketch p, q and each of the vectors that you found in part (a), 
starting each vector at the origin. 


The following simple result about position vectors is often useful. 


Let A and B be points (in R? or RÌ), with position vectors a and b, 
respectively. Then 


AG = bea 


To see this, let O be the origin, as shown in Figure 61. Then 
AB = AO + OB (by the Triangle Law for vector addition) 


-OÀ + 0B 


= —a +b 
= b-a, 


as claimed. 


The sets R? and R? 


Finally, we clarify some issues about the sets R? and R?. You have seen 
that we use the notation R? to denote the plane, and the notation R° to 
denote three-dimensional space. Strictly, the meaning of these notations is 
as follows: 

R? = {(x,y) :2,y € R}, 

R? = {(x,y,z) : x,y,z E R}. 
That is, R? is the set of all ordered pairs of real numbers, and R? is the set 
of all ordered triples of real numbers. We interpret these sets as the plane 
and as three-dimensional space, respectively, by interpreting their elements 
as the coordinates of points with respect to particular coordinate systems, 
in the way that you have seen. 


However, it is often useful to instead interpret the elements of R? and R? 
as vectors. For example, we can interpret the element (2,—1) of R? not as 
the point with coordinates (2,—1), but instead as the vector with 
component form (2,—1). 


We can use whichever interpretation of R? and R° is more useful in a 
particular context. A link between the two interpretations is provided by 
position vectors, because the vector with component form (x,y) is the 
position vector of the point with coordinates (x,y), and similarly the 
vector with component form (x,y,z) is the position vector of the point 
with coordinates (x,y, z). 


This link also makes it straightforward to represent a particular point not 
by coordinates, but by a vector: we use its position vector. It might seem 
that this amounts to much the same thing, but the advantage of 
representing points by vectors is that it enables us to use the properties of 
vectors to work with points. This leads to some very convenient ways of 
working with points, as you will see in the next subsection and again in 
Subsection 4.5. In Book C you will see how generalising all these ideas 
leads to some interesting and very useful mathematics. 


4 Vectors 


Figure 61 Points A and B 
and their position vectors 


71 


Unit A1 Sets, functions and vectors 


Figure 62 A point R on the 
line 1 


Figure 63 The position of R 
determined by A 


72 


4.3 Vector form of the equation of a line 


In Subsection 1.1, we found that every line in the plane has an equation of 
the form 


ax + by =c, 


where a, b and c are real numbers, with a and b not both zero. In this 
subsection we find an equivalent general form for the equation of a line in 
terms of vectors. Unlike the equation above, this vector form applies to 
lines in R? as well as in RÊ, as you will see later in this subsection. 


Let P and Q be points with position vectors p and q, respectively, and 

let l be the line that passes through P and Q, as illustrated in Figure 62. 
We now find an expression for the position vector r of an arbitrary point R 
on / in terms of the position vectors p and q. 


Since the vector PR is parallel to the vector PO, it must be a multiple of 
PO, that is, 


PR = APQ, 


for some real number àA. Now, by the result about position vectors given at 
the end of the last subsection, we have 


PR=r-—p and PO=q-p. 
So 
r—p=XA(q-p). 
We can rearrange this equation as 
that is, 
r= (1—A)p+ àq. (3) 


This is a general formula for the position vector of a point on the line 
through P and Q, in the following sense: each point on l corresponds to a 
particular value of A, and vice versa. As shown in Figure 63, we have the 
following. 


e Jf A = 0, then r = 1p + 0q = p. 


e If A= 1, then r = 0p + 1q = q. 

e Jf à > 1, then R lies on l beyond Q. 

e If0<A <1, then R lies on l between P and Q. 
e If \ <0, then R lies on l beyond P. 


So we can regard equation (3) as the vector form of the equation of the 
line l. 


Vector form of the equation of a line 


The equation of the line through the points with position vectors p 
and q is 
r=(1—A)p+ Aq, where AER. 


Note in particular that when \ = = in the equation above, we have 
r= ip + $q = $(p +q), which is the position vector of the midpoint of 
the line segment PQ. 


Worked Exercise A18 


(a) Let P and Q be the points with position vectors p = (1,3) and 
q = (—1, —2), respectively. Find the vector form of the equation of 
the line l through P and Q. 


(b) Determine whether the point (3,8) lies on J. 


Exercise A48 


Let P and Q be the points with position vectors p = (3,1) and q = (2,3), 
respectively. Let l be the line through P and Q. 


(a) Find the vector form of the equation of the line l. 


(b) Determine the three points on l whose position vectors are given by 
the equation you found in part (a) when A takes the values 4, 2 and 
—5, respectively. 

(c) Ona single diagram, sketch P, Q, the line / through P and Q, and 
the three points that you found in part (b). 


4 Vectors 


73 


Unit A1 Sets, functions and vectors 


74 


Exercise A49 


Let P, Q and l be as in Exercise A48. 


(a) Determine the value of \ corresponding to the point (4, —1) in the 
vector form of the equation of l. 


(b) Use the vector form of the equation of l to prove that the point (4, 5) 
does not lie on J. 


In the vector form of the equation of a line, there is no assumption that p 
and q are position vectors of points in R?: they may equally well be 
position vectors in R. 


Worked Exercise A19 


(a) Let P and Q be the points with position vectors p = (1, 2,3) and 
q = (3, —2, 1), respectively. Find the vector form of the equation of 
the line / through P and Q. 


(b) Determine whether the point (4,—4,0) lies on the line J. 


Exercise A50 


(a) Let P and Q be the points with position vectors p = (2,1,0) and 
q = (1,0, —1), respectively. Find the vector form of the equation of 
the line / through P and Q. 


(b) Determine the points on l whose position vectors are given by the 
equation you found in part (a) when A takes the values 5 and —1. 


4 Vectors 


4.4 Scalar product 


In this subsection you will meet a way of combining two vectors, known as 
the scalar product or dot product, which is useful in linear algebra, as you 
will see in Book C. 


The definition of the scalar product is given below. It applies to vectors in 
both R? and R. 


Definition 
If u and v are non-zero vectors in R? or R?, then the scalar product 
(or dot product) of u and v is 


u- v = |u||v| cos 8, 
where 0 is the angle between u and v. 


If one or both of u and v is the zero vector, then u- v = 0. 


The scalar product of two vectors is a scalar, hence the name. 


Note that the angle between two vectors is defined to be the angle @ in the 
range 0 < 0 < m between their directions when the vectors are placed to 
have the same starting point (not necessarily the origin), as illustrated in 
Figure 64 for vectors in R? and R. 


` y 
(a) (b) & 


Figure 64 The angle 0 between two vectors u and v in (a) R? and (b) R 


3 


Let us use the definition of the scalar product to calculate the scalar 
product u » v of the vectors u = (2,0) and v = (3,3) in R?, which are 
shown in Figure 65. We have 


|u| = 2 


v = (3,3) 


£ 


u = (2,0) 


lv| = V32 +32 = V2 x 32 = 3v2. Figure 65 The vectors 


The angle 0 between the vectors u and v is 7/4. Hence ep) OR 8) 


and 


u. v = |ul|v|cosé 


= 2x 3V2 x cos 7 


1 
SOR 


=ô. 


75 


Unit A1 Sets, functions and vectors 


Figure 66 A vector v 


< 


Figure 67 Perpendicular 
vectors u and v 


76 


There is an easier way to calculate the scalar product of two vectors, which 
does not depend on knowing the angle between them, but just involves 
their components. You will meet this method shortly, but first we will use 
the definition of the scalar product to derive some of its properties. 


To start with, consider any vector v in R? or R?, as illustrated in 
Figure 66. Let us find the scalar product of v with itself. The angle 
between v and itself is 0, so we have 


v-v=|v||v|cos0 = |v|? x 1 = |v|?. 


This gives the following property. 


Magnitude of a vector in terms of scalar product 


For any vector v in R? or R, 


lv] = Vv-v. 


Now consider any two vectors u and v in R? or R? that are at right angles 
to each other, as illustrated in Figure 67. Their scalar product is 


T 
u- v = |u||v| cos A ju||v| x 0 = 0. 
So the scalar product of any two perpendicular vectors is 0. 


A converse of this result also holds. Suppose that u and v are vectors in 
R? or R? whose scalar product is 0. Then, by the definition of the scalar 
product, 


|u||v| cos 8 = 0, 
where @ is the angle between u and v. It follows that 
jul =O or |v|=0 or cos = 0, 
and hence 
T 
u=0 o v=0 or taz 
So we have the following property. 


Scalar product and perpendicularity 
Let u and v be vectors. 
e If u and v are perpendicular, then u- v = 0. 


e Ifu-v = 0, then u = 0, or v = 0, or u and v are perpendicular. 


4 Vectors 


Finally, the scalar product has the following algebraic properties. 


Algebraic properties of scalar product 


Let u, v and w be vectors in R? or R3, and let a € R. The following 
properties hold. 


Commutativity u-v=v-u 
Multiples (au). v =u- (av) = a(u-v) 


Distributivity u-(v+w)=u-v+u-w, 
(ut+v)-w=u-wt+v-w. 


Note that the distributive properties in the box also hold if the plus signs 
are replaced by minus signs. This follows by combining the distributive 
properties with the multiples property for a = —1. 


The properties of the scalar product in the box can be proved by using the 
definition of the scalar product. 


The commutative property follows immediately from the definition. 


To see why the multiples property holds, let u and v be two vectors in R? 
or RÌ, and first suppose that a is a positive constant. If the angle between 
u and v is 0, then the angle between au and v is also 0, as illustrated in 
Figure 68, so 


(au) + v = jaul|v| cos 8 


= |a||u||v| cos 0 


= a|u||v| cos 8 (since a is positive) Figure 68 . Veciois th aiid vy, 
=a(u-v), and a scalar multiple au of u, 
and, similarly, where q is positive 


u: (av) = |ul|av| cos 8 
= |u||a||v| cos 0 
= q|u||v| cos 8 
=a(u-v). 
The multiples property can be proved in the case where a is negative in a 
similar way. In this case the angle between au and v, and also the angle 


between u and av, is m — 0, but cos(a — 0) = — cos 0 by the properties of 
the cosine function (see the module Handbook). 


The proof of the distributive properties is more complicated, and the 
details are omitted here. 


77 


Unit A1 Sets, functions and vectors 


Using the properties of the scalar product given above, we can prove the 
following simple formulas for calculating the scalar product. 


Scalar product of vectors in component form 


In R?, let u = (21, y1) and v = (x2, y2). Then 


WL ONY = SPIED ar VND 


In R®, let u = (21, y1, 21) and v = (22, y2, 22). Then 


uU- v = tiz + Yy + 2122. 


Here is a proof of the formula above for vectors in R?. The proof for 
vectors in R3 is similar, but longer. 


Let u and v be vectors in R?. We write them in component form as 
u=2i+yj and v=22i+ yj, 
as shown in Figure 69 below. 


y 


v = (£2, Y2) = Lait yoj 


j = (0,1) l ; 
u = (21,41) = zi + yj 


i= (1,0) z 
Figure 69 The vectors u and v in component form 


This gives 
u-v = (xii + yij) + (wei + y2) 
= (xii + yj) x2i + (x11 + yj) + y2j (by distributivity) 
xii. gai + yij + T2 + rii- y2j + yj yal 
(by distributivity) 
= giti İ + yT) i+ ziyi- j + y1y2j'j 
(by the multiples property). 


Now i and j have magnitude 1, so by the formula for the magnitude of a 
vector in terms of scalar product, given earlier, 


i-i=1?=1 and j-j=1°=1. 

Also, i and j are perpendicular, so 
i-j=j-i=0. 

Hence 
u-vV=271%2X l +yz xX 0 + z1y2 X 0O+ y1y2 x 1 


= it + Y1Y2, 
as claimed. 


78 


Worked Exercise A20 


Calculate the following scalar products. 
(a) (3,3)-(2,0) (b) (21+3j)- (21-5) (c) (V2, -4) - (2V2,1) 
(d) (1; =; 1) ° (1, =l; 1) 


Worked Exercise A20(a) is the particular scalar product that was 
calculated using the original definition near the start of this subsection. 


Notice that the result of Worked Exercise A20(c) shows that the vectors 
(V2, —4) and (2V2, 1) are perpendicular, something that is not 
immediately obvious when we look at their component forms. 


Exercise A51 


Calculate the following scalar products. 
(a) (2,3)*(8,-4) —(b) (1,4)*(2,-4) (©) (28 + 5) - (BE - 23) 
(da) (1,-—1, —2) - (3, —2,—5) 


One useful application of the scalar product is that it provides a method 
for finding the angle between two vectors, as illustrated in Figure 70. The 
formula below is obtained by rearranging the original definition of the 
scalar product. 


Angle between two vectors 


The angle 0 between two vectors u and v is given by 


cos o = A 
u||v| 


4 Vectors 


v 


Figure 70 Two vectors u and 
v, and the angle 0 between 
them 


79 


Unit A1 Sets, functions and vectors 


D 
[e 
II 
ens. 
ae 
| 
N 
So 
Xy 


Figure 71 The vectors 
u = (4, —2) and v = (9,3) 


u = (3,1, —1) 


Figure 72 The vectors 
u = (3,1,—1) and v = (1,2,3) 


80 


In the next worked exercise this formula is used to find the angle between 
two vectors in R?. 


Worked Exercise A21 


Find the angle @ between the vectors u = (4, —2) and v = (9,3), in 
radians. (These vectors are shown in Figure 71.) 


Exercise A52 


Find the angle between the vectors in each of the following pairs of vectors, 
in radians. Give your answer to two decimal places unless it is an obvious 
multiple of 7. 


(a) (1,4), (5,2) (b) (—2,2), (1,-1) (e) 91 — 2j, i+ 2j 


The formula for the angle between two vectors works equally well in RÌ, as 
is shown in the next worked exercise. 


Worked Exercise A22 


Find the angle 6 between the vectors u = (3,1,—1) and v = (1, 2,3), in 
radians to two decimal places. (These vectors are shown in Figure 72.) 


Exercise A53 


Find the angle between the following pairs of vectors, in radians to two 
decimal places. 


(a) (3,4,5), (1,0, —1) 
(b) 2j- 3k, —i— j — 2k 


4.5 Equation of a plane in R 


In Subsection 1.1 you saw that the general form of the equation of a line in 
R? is ax + by = c, where a,b,c € R, and a and b are not both zero. We can 
use the scalar product to derive a similar general form for the equation of a 
plane in R?, as you will see in this subsection. In doing this, we will also 

derive a general form for the equation of a plane in R? in terms of vectors. 


First, let us look at some planes in R? whose equations are easy to find. ZA 
The ‘simplest’ planes in R? are the three planes that contain a pair of axes. 
The (x, y)-plane is the plane that contains the x- and y-axes, as 
illustrated in Figure 73. The (x, z)-plane and the (y, z)-plane are defined 
similarly. The points that lie in the (x, y)-plane are the points (x,y,z) in 
R3 for which z = 0, so the equation of the (x, y)-plane is 


z=0. 


T 


4 Vectors 


Figure 73 The (x, y)-plane 


81 


Unit A1 Sets, functions and vectors 


> 


à 
\ 


Figure 75 Parallel planes 


82 


Exercise A54 


Write down the equations of the (y, z)-plane and the (z, z)-plane. 


Exercise A55 


Sketch the planes whose equations are as follows. 
(a) z=2 (b) y=-1 


Before we derive the general equation of a plane in R°, we need the 
following concept. 


Definition 
A vector that is perpendicular to all the vectors in a particular plane 


is called a normal vector (or simply a normal) to the plane. Its 
direction is said to be normal to the plane. 


Figure 74(a) shows some normal vectors to a plane. If n is a normal vector 
to a particular plane, then so is kn, for any non-zero real number k. If 

k > 0, then kn is in the same direction as n, whereas if k < 0, then kn is 
in the opposite direction to n, as illustrated in Figure 74(b). 


VA \\ 
\ \ i 


(a) (b) kn (k < 0) 
Figure 74 Some normal vectors to a plane 


Any vector n in R? is a normal vector to infinitely many planes, all parallel 
to each other, as illustrated in Figure 75. 


We can specify any particular plane in R? by specifying a normal vector to 
the plane, together with a point that lies in the plane. For example, there 
is exactly one plane that contains the point P(2,3,4) and has 

n = (1,2,—1) as a normal. 


Here is how we can find an equation for this particular plane. A condition 
for an arbitrary point X (x,y,z) in R3 to lie in the plane is that the vector 


PX must be perpendicular to the normal vector n, as illustrated in 
Figure 76. In other words, we must have 


PX n= 


Now 
PX =x-—p (where x and p are the position vectors of X and P) 
= (č; Y, z) ~ (2, 3, 4) 
= (x —2,y—3,z-4). 
Hence the condition for the point X(z, y, z) to lie in the plane is 
(x — 2,y — 3,z—4) . (1,2,—1) — 0, 
that is, 
(a —2)x 1+ (y—3) x 24+ (z-4) x (-1) =0, 
which simplifies to 
e+2y—z=4. 
This is the equation of the plane. 
In fact every plane in R? has an equation of the form 
ax + by + cz = d, 


for some real numbers a, b, c and d. To prove this, we apply the argument 
above to a general plane. Consider the plane that contains the point 
P(x, y1, 21) and has n = (a,b,c) as a normal vector, as illustrated in 
Figure 77. 


n = (a,b,c) 
KO) 
PAL, Wia 21) 


Figure 77 An arbitrary point X(z, y, z) on the plane containing the point 
P(a1,y1, 21) with normal n = (a, b,c) 


4 Vectors 


XD, Une) 


P(2,3,4) 
n = (1,2,—1) 


Figure 76 An arbitrary point 
X (x,y,z) on the plane 
containing the point P(2, 3, 4) 
with normal n = (1,2, —1) 


83 


Unit A1 Sets, functions and vectors 


84 


A condition for an arbitrary point X (x,y,z) in R? to lie in this plane is 
that the vectors PX and n must be perpendicular, that is, 


PY nae. 


Since PX = x — p, where x and p are the position vectors of X and P, 
respectively, this condition can be written as 

(x—p)-n=0. 
By the algebraic properties of the scalar product, we can write the 
condition as 

x-n—p:n=0, 
that is, 

xX*n=p-n. 
This is the vector form of the equation of the plane. Alternatively, we can 


write it in terms of the coordinates x, y and z, by substituting for 
x = (x,y,z), n = (a,b,c) and p = (24, y1, 21). Then the equation becomes 


(tyz) A (a,b, č) = (21591; 21) j (a, b, c); 


that is, 
ax + by + cz = ax, + byi + €21. 


This equation is of the form 


ax + by + cz =d, 


where d is the real number given by d = ax, + by; + cz1. So we have shown 
that every plane in R? has an equation of this form, for some real numbers 
a, b, c and d. 


Equation of a plane in R° 
The equation of the plane that contains the point (x1, yi, 21) and has 
the vector n = (a,b,c) as a normal is 
ax + by + cz = d, 
where d = ax, + by, + cz. 


This equation can be written in vector form as 
x-n=p'n, 


where x = (0m2) and p = (1, y1, 21). 


Once we know the equation of a plane in the form ag + by + cz = d, we can 
‘read off’ the components of a normal vector, as they are the coefficients of 
x, y and z in the equation. For instance, one normal to the plane with 
equation x — 2y + 3z = 7 is n = (1,—2,3). Note that the zero vector can 
never be a normal since its direction is undefined. 


When we want to find the equation of a plane, it is simpler to start from 
the vector form of the equation, as demonstrated in the next worked 
exercise. 


Summary 


Worked Exercise A23 


Determine the equation of the plane in R? that contains the point 
(1,—1,4) and has the vector (2, —2,3) as a normal. 


Exercise A56 


Determine the equation of each of the following planes. 


(a) The plane that contains the point (1,0,2) and has the vector (2,3, 1) 
as a normal. 


(b) The plane that contains the point (—1,1,5) and has the 
vector (4, —2,1) as a normal. 


In Book C you will see how you can find the equation of a plane in R3 if 
you know three points on the plane, rather than a point and a normal. 


Summary 


In this unit you have studied some fundamental ideas in mathematics. You 
have met a new notation for specifying sets and encountered examples of 
sets of numbers and sets of points. You have studied the operations of 
union, intersection and difference that can be performed on sets, and seen 
how to show that two sets are equal. You have also met many examples of 
functions between sets, and seen that a one-to-one function has an inverse. 
Finally, you have worked with vectors and seen how to carry out vector 
arithmetic in component form and use the scalar product of two vectors. 


85 


Unit A1 Sets, functions and vectors 


86 


Throughout the unit you have worked especially with the sets R, R? 

and R, of real numbers, ordered pairs of real numbers and ordered triples 
of real numbers, respectively. You have seen that the elements of these sets 
can be regarded geometrically as points on the real line, in the plane and 
in space, and that points in R? or R? can also be identified with their 
position vectors. 


You will continue your study of foundational mathematical concepts in the 
rest of Book A, and the ideas you meet here will be in constant use 
throughout this module. 


Learning outcomes 


After working through this unit, you should be able to: 
e recognise the equation of a line and the equation of a circle in R? 
e use set notation and the notation of intervals of the real line 


e determine whether one set is a subset of another, and whether two sets 
are equal 


e find the union, intersection and difference of two sets 

e define a function and its domain, codomain and rule 

e determine the image set of a function 

e determine whether a function is one-to-one and/or onto 


e find the inverse of a one-to-one function, and the composite of two 
functions 


e explain what are meant by a vector, a scalar, a scalar multiple of a 
vector, and the sum and difference of two vectors 


e represent vectors in R? and R? in terms of their components, and carry 
out vector arithmetic using components 


e determine the equation of a line in R? or R in terms of vectors 


e explain what is meant by the scalar product of two vectors, and use it to 
find the angle between two vectors 


e recognise the equation of a plane in RÌ, and the vector form of the 
equation 


e determine the equation of a plane in RÌ, given a point in the plane and a 
normal to the plane. 


Solutions to exercises 


Solution to Exercise Al 


Using the formula for the equation of a line when 
given its gradient and one point on it, we find that 
the equation of this line is 


y — (-1) = —3(a — 2). 
We can rearrange this to 
y= —3xr + 5, 
or 
3r +y = 5. 


Solution to Exercise A2 


(a) Since (1,1) and (3,5) lie on the line, its 
gradient is 
1-5 

m= = =2 
Then, since the point (1,1) lies on the line, its 
equation must be 

y-1=2(2—1), 
so 

y=2x-—1, or 2x-—y=1. 
(b) Both these points have xz-coordinate 0, so they 
lie on the line with equation x = 0, the y-axis. 


(c) Since the origin lies on the line, its equation 

must be of the form y = mg, where m is its 

gradient. 

Since (4, 2) lies on the line, its coordinates must 

satisfy the equation of the line. Thus 2 = 4m, so 
_i 

Hence the equation of this line is y = $x, or 

5x —y =0, or g= 2y = 0. 

(d) Both these points have y-coordinate —1, so 

they lie on the line with equation y = —1. 


Solution to Exercise A3 


We can rearrange the equations of the lines to find 
their gradients as follows: 


l: y =—2r + 4 b: y=2x+$ 
l3: y=-—4r+5 l: y=4r- 5 
ls: y=4r+1 Ig: y=- 


Solutions to exercises 


Thus the gradients of the given lines are —2, 2, 


7 7 4 and —2, respectively. 


It follows that the lines lı and le are parallel, since 
their gradients are the same but their y-intercepts 
are different. Similarly, /4 and /5 are also parallel. 


Lines lı and l4 are perpendicular, since the product 
of their gradients is —1. For the same reason, each 
of the following pairs of lines are perpendicular: 

l and l5; l2 and l3; l4 and lg; and l5 and lę. 
Solution to Exercise A4 


We use the formula for the distance between two 
points in the plane. This gives the following 
distances. 


(c) /(—1?+0-2)=Vir7 


(da) ,/(-1 — 3)? + (4 — (-8))? = V160 

= 4/10 
(The two points in part (a) are on the x-axis, so in 
fact there is no need to use the distance formula to 
find the distance between them.) 


Solution to Exercise A5 
(a) This circle has equation 
(z - 0} + (y - 0} = 4", 
which can be simplified to give 
a? +y = 16. 
(b) This circle has equation 
(« — (—1))* + (y- 0)? = (v2)’, 
which can be simplified to give 
(+1)? +y =2. 
(c) This circle has equation 
(a — 8)? + (y — (-4))? =F, 
which can be simplified to give 


(x — 3)? + (y+4)? =4. 


87 


Unit A1 Sets, functions and vectors 


88 


Solution to Exercise A6 


ZA 


27- i (0, 1, 2) 


Solution to Exercise A7 


We use the formula for the distance between two 
points in R*. This gives the following distances. 


(a) (4—1) + (1—1) + (-3- 1)? 


=/F 204 BS} 
(b) «/G= 12+ 0— 
= /44+440 =2V2 


Solution to Exercise A8 

(a) True: —3 is an integer. 

(b) False: 5 is a natural number. 

(c) False: 1.3 is the rational number B, 

(d) True: both 1 and 3 are rational numbers. 
(e) True: —7 is a real number. 

(£) False: 5 is not a natural number. 


(g) False: 1 is a non-zero real number, but 0 is 
not. 


(h) False: v2 is a real number. 


Solution to Exercise A9 

(a) True: 1 is a member of the given set. 

(b) True: the set {—9} is a member of the given 
set, although the number —9 is not. 

(c) False: the number 9 belongs to the given set, 
but the set {9} does not. 


(d) False: the point (0,1) is not a member of the 
given set of points in R?, although the point (1,0) 
is. 

(e) False: the numbers 1 and 0 are not members 


of the given set of points in R?, although the point 
(1,0) is. 


(f) True: the set {1,0} is the same as the set 
{0,1}, and so is a member of the given set. Notice 
that the members of this set are themselves sets, 
and not points in R?. 


Solution to Exercise A10 
(a) True: > is in R, and it satisfies the condition 
T> 3. 


(b) True: 7 = 3 x 2+ 1, so 7 is of the form 3k + 1 
for some k € Z. 

(c) False: — is not in Z. 

(d) False: 8 cannot be expressed as 2” for some 


number z € R satisfying 0 < x < 2; in fact 8 = 2°. 


(e) True: 9 is in Z, and 9 = 3? so 9 = k? for some 
kez. 


(£) True: 6 = 3(3 — 1), so 6 is of the form 
m(m — 1) for some m E N. 


(g) False: 4 is an even integer, but it does not 
satisfy 0< r < 4. 


Solution to Exercise A11 
(a) {k €Z :—2< k< 1000} 


(b) {x € Q: z> 0 and z? > 2} or 
{rEQ:x>0, 2? >2} 


(c) {2n:ne€N} 
(d) {2*:k eZ} 


Solution to Exercise A12 


(a) False: the set (1,5) is an open interval and 
does not include the endpoint 1. 


(b) True: the set (—1, 1] is half-open, with the 
upper endpoint 1 included. 


(c) False: co does not denote a number and so is 
not in the interval. 


(d) True: R* denotes the set of non-zero real 
numbers, so 0 is not a member of this set. 


(e) False: x € R* means z is a non-zero real 
number, while (0,00) comprises just the positive 
real numbers. For example, the number —1 is in 
R*, but not in (0,00). 


Solution to Exercise A13 
(a) [-11,2) 

(b) (—6.5,21] 

(c) (—273, 00) 


Solution to Exercise A14 
(a) l= {(x,y) E€ R? : y = 2x + 5} 


(There are other ways to specify this line; another 
example is l = {(x, 2x +5) : x € R}.) 


(b) The line l has equation y = 1 — z, so it is as 
follows. 


(c) The line l has equation y = x (since here 
m = 1 and c = 0), so it is as follows. 


Solutions to exercises 


Solution to Exercise A15 
(a) C = {(z,y) E€ R? : (z — 1)? + (y +4) =9} 
(b) The circle has centre (1,3) and radius 2. 


YA 


14 


Solution to Exercise A16 


(a) This set is a half-plane with the boundary line 
excluded, as follows. 


YA 


| 

| 

| 

| 

| 

jc=1 

| 

| > 
T 

| 


1 


(b) This set is another half-plane, but this time 
the boundary line is included, as follows. 


89 


Unit A1 Sets, functions and vectors 


(c) This set is a disc with the boundary excluded, 


as follows. 
Y (@ 1)? +(y—2)? =4 
y 
/ 2/ \ 
| | 
\ / 
ae. 
hl á 
= T 


(d) This set consists of the points outside a disc 
with centre (0, —3) and radius 1, as follows. 


YA 


Ry 


“a ~ 
N 
N] \ 
| | 
> | 


Solution to Exercise A17 
{(2,y) €R?:0<2<2,1<y<3} 


y 


Solution to Exercise A18 


(a) The set B consists of the solutions of the 
equation 


r’ +r—6=0, 
which we can write as 

(x — 2)(x +3) =0. 
So B = {2,-3} =A. 


90 


(b) The two sets are 
A= {k€ Z: kis odd and0 < k < 8} 
= 41;3,.5,7}, 
B={2n+1:n €N and n? < 25} 
= {3,5,7,9}. 
Hence A Æ B, either because 9 € B but 9 ¢ A, or 
because 1 € A but 1 ¢ B. 


Solution to Exercise A19 


(a) Each element of A is a point in R2. 


We calculate x — 4y using the coordinates of each 


point of A: 
5-4x2=-3, 
1-4x1l=-3, 
—3 — 4 x 0 = —3. 


This shows that each element of A is an element of 
B,so AC B. 


(b) The sets A and B are sketched below. 
YA 


B 


The set A is the interior of the unit circle, and B is 
the half-plane consisting of all points with negative 
y-coordinate. So A É B, because, for example, the 


point (5, 5) belongs to A but not to B. (Any one 
point that is in set A but not in set B shows that 
AtB.) 

(c) Let x be an arbitrary element of A; then z € R 
and satisfies —1 < x < 0. This equation gives 


—1+1<zr+1<0+1, 
that is, 

O<¢4+1<1. 
Hence 

0<(2£+1) <1, 
so x E€ B. 


Since x is an arbitrary element of A, we conclude 
that AC B. 


Solution to Exercise A20 


(a) We showed that A C B in the solution to 
Exercise A19(a). Also, for example, the point (9, 3) 
lies in B, since 

9-4x3=-3, 
but does not lie in A. Therefore A is a proper 
subset of B. 


(b) We showed that A C B in the solution to 
Exercise A19(c). Also, for example, —2 lies in B, 
since 


(-2+1)? = (-1} =1, 


but does not lie in A. Therefore A is a proper 
subset of B. 


Solution to Exercise A21 
(a) First we show that A C B. 
Let (x,y) € A; then (x,y) € R?, and for some 
t € R, we have x = t and y = 2t. Hence 
y = (2)? = 4t? = 4r. 
So (x,y) € B, and AC B. 
Next we show that B C A. 


Let (x,y) € B; then y? = 4x. We must show that 
there is a value of t in R such that z = t? and 

y = 2t, so that (x,y) € A. Let t be given by y = 2t; 
that is, t = SY: Then, since 4x = y?, we have 

g= ty’, and substituting for y gives 


1 
= (W)? =??. 
x 78 


Hence (x,y) = (t, 2t) € A, and so BC A. 
Since A C B and B C A, it follows that A = B. 
(b) First we show that A C B. 
Let (x,y) € A; then 2z +y — 3 = 0. We must show 
that there is a value of t in R such that x = t+ 1 
and y = 1 — 2t. Let t be given by x = t +1, that is, 
t= x -— 1. Then, since 2a + y — 3 = 0, we have 
y=3 = 2r 

=3—2(t+1) 

=] = 26: 
Hence (x,y) = (t +1,1 — 2t) € B, and so A C B. 
Next we show that B C A. 


Solutions to exercises 


Let (x,y) € B; then (x,y) € R?, and for some 
t € R, we have x = t + 1 and y = 1 — 2t. We must 
show that (x,y) satisfies 2x + y — 3 = 0. Now 


Qn +y —3 = 2(t+1) 3 
= 0, 
as required, so (x,y) E€ A. Therefore B C A. 
Since A C B and B C A, it follows that A = B. 


Solution to Exercise A22 
(a) (1,7) U [4,11] = (1,11. 


(b) R* denotes the set of non-zero real numbers, 
and so is the union of the two intervals (—oo, 0) 
and (0,00); that is 


R* = (—oo, 0) U (0, ov). 


(c) The union of the half-plane and disc is 


YA 
=2 2 s 
—-21 HUD 


Solution to Exercise A23 
(a) (1,7) [4,11] = [4, 7). 


(b) The intersection is 


YA 
24 
—2 2 
\ & 
= HAD 


Solution to Exercise A24 


(a) (1,7) — [4,11] = (1,4) and 
A=) = ira, 


91 


Unit A1 Sets, functions and vectors 


92 


(b) The two differences are 


YA YA 


D-H 
2- 2 
—~2 2 
N |Z 
je a) -2 4 


Solution to Exercise A25 


(a) This is the translation of the plane that moves 
each point to the right by 2 units and up by 
3 units. 


(b) This is the reflection of the plane in the x-axis. 


(c) This is the rotation of the plane through 7/2 
anticlockwise about the origin. 


Solution to Exercise A26 
Only diagram (b) represents a function. 


Diagram (a) does not represent a function, as there 
is no arrow from the element 3. 


Diagram (c) does not represent a function, as there 
are two arrows from the element 1. 


Solution to Exercise A27 

(a) f(S) = {f(0), f0), f(2), f(3)} 
= {—1,0, 1,2}. 

(b) F(Z) = {..., KDI Os Ht 
= {...,—2,—1,0,1 ...} 
=Z. 


Solution to Exercise A28 

The images of the elements of A are 
f(0) =9, f1)=8, f(2)=7, f(3)= 6, 
f(4) =5, f(5) =4, f(6)=3, f(7)=2, 
f(8) =1, f(9) =0. 


So the image set of f is 
{0, 1,2, 3,4,5,6,7,8,9} =A. 
Solution to Exercise A29 


Only diagram (a) represents an onto function. 


Diagram (b) does not even represent a function, as 
there is no arrow from the element 4. 


Diagram (c) represents a function that is not onto, 
as there is no arrow going to the element 1. 


Solution to Exercise A30 


(a) The sketch of the graph of f below suggests 
that f (IR) = [1, 00). 


y 


y=1+2? 


> 
| x 


We prove that f(R) = [1, 00). 

Let x € R; then f(z) =1+ 27. Since z? > 0, we 
have 1+ 2? > 1 and so f(R) C [1, 00). 

We must show that f(R) 2 [1, 00). 

Let y € [1,oo). We must show that there exists 

x € R such that f(x) = y; that is, 1+ £? = y. Now 
y — 1 is real, since y > 1, and satisfies 

f(x) = y, as required. (Alternatively, r = —/y—I 
is real and satisfies f(x) = y.) 

Thus f(R) 2 [1, 00). 

Since f(R) C [1,co) and f(R) 2 [1, 00), it follows 
that f (IR) = [1, 00), so the image set of f is [1, 00), 
as expected. 

The interval [1, 00) is not the whole of the 
codomain R, so f is not onto. 


tC —=<— 


(b) This function is the reflection of the plane in 
the x-axis. This suggests that f (R?) = R?. 


We know that f(R?) C R?, so we must show that 
JR?) 2 R?. 

Let (2’,y’) € R?. We must show that there exists 
(x,y) E€ R? such that f(z, y) = (2’,y’); 

80 (x, —y) (2, y"), that is, 


r=, -y=y'. 


Rearranging these equations, we obtain 


So, (x,y) € R? and f(x,y) = (a’,y’), as required. 
Thus f(R?) D R?. 


Since f (R?) C R? and f(R?) D R, it follows that 
f (R?) = R?, so the image set of f is R?, 
as expected. 


The codomain of f is also R?, so f is onto. 


Solution to Exercise A31 
Only diagram (c) represents a one-to-one function. 


Diagram (a) represents a function that is not 
one-to-one, as there are two arrows going to the 
element 3. 


Diagram (b) does not even represent a function, as 
there is no arrow from the element 2. 


Solution to Exercise A32 


(a) This function is not one-to-one since, for 
example, 


f(2) = f(-2) =14+4=5. 


(b) This function is the reflection of the plane in 
the x-axis, so we expect it to be one-to-one. We 
now prove this algebraically. 


Suppose that f(x1, y1) = f (x2, ye); then 
eae —y1) = (£2, —y2). 


This means that zı = z2 and —y, = —y2,. It 
follows that y1 = y2, so we have shown that 
(21, y1) = (2, Y2), that is, f is one-to-one. 


Solution to Exercise A33 


(a) In Exercise A32 we saw that f is not 
one-to-one, so f does not have an inverse function. 


(b) In Exercise A32 we saw that f is one-to-one, 
so f has an inverse function. 


In Exercise A30 we saw that the image set of f is 
R? and, for each (x', y’) € RÊ, we have 


(x,y) = f(a’, -y’). 
So f~t is the function 
ft: R? — R? 
(25 y') => (as —y’). 


Solutions to exercises 


This can be expressed in terms of xz and y as 

ft: R? — R? 

(x,y) —> (x, —y). 

(In this case, f7! is actually equal to f, which is 
what we would expect for a reflection.) 
(c) The graph of this function is an upward 
sloping straight line, which suggests that it is 
one-to-one. First we confirm this algebraically. 
Suppose that f(x1) = f(x2); then 


827, +3 = 8x2 + 3, 


so 8x1 = 8x9, and hence zı = rg. Thus f is 
one-to-one, and so it has an inverse function. 


We now find the image set of f. We suspect that 
its image set is R, so we now prove this 


algebraically. Let y be an arbitrary element in R. 


We must show that there exists an element x in 
the domain R such that 


f(x)=y; thatis, 8%1+3=y. 

Rearranging this equation, we obtain 
=t 2 
=n 


This is in R and satisfies f(x) = y, as required. 
Thus the image set of f is R. 


Hence f~! is the function 
fi:R—OR 


fi:R—R 


Solution to Exercise A34 
The function 
g : [-7/2, 7/2] — [-1,1] 
z — sing 
is a restriction of f that is one-to-one. 


(There are many other possibilities, for example, 
the restriction of the domain to [r /2, 37/2].) 


93 


Unit A1 Sets, functions and vectors 


Solution to Exercise A35 
(a) The rule of go f is 
(go f)(x) = g(f(x)) = g(-2) 
= 3(-2) +1 
=-—3r+1. 
Thus go f is the function 
gof:R—R 
t ı—> -3r + 1. 
(b) The rule of f o g is 
(£ o g)(x) = f(g(z)) = fx + 1) 
= — (3x + 1) 
===], 
Thus f o g is the function 
fog:R—R 
rr > —3a2—1. 


Solution to Exercise A36 
The rule of fog is 
Thus f og is the function 
fog:R? — R? 
(x,y) =} (=z, —y). 
(In this case, fog =go f.) 


Solution to Exercise A37 
The rule of go f is 
(go f)(x) = g(f(x)) = g(3x + 1) 
3 
(82 +1)+2 


The domain of go f is 
{x € [-1,1]: f(z) € R — {-2}}. 


If x € [—1,1], then f(x) € R— {—2} unless 
f(x) = —2. Now f(z) = —2 when 


3x + 1 = —2, 
that is, when 


x=-l. 


94 


So the domain of go f is 


[—1, 1] — {-1} = (-1, 1]. 
Thus g o f is the function 
gof:(-1,1] — R 
1 

x+1 


Solution to Exercise A38 
The domain of f is R, and for each x € R we have 
5r — 3) +3 
g(f(2)) = 9 (62-3) = EIH _ 
that is, go f = ig. 


The domain of g is also R, and for each y € R we 
have 


slaty) =F (H) =5 (HF) -35u 


that is, fog = ip. 


Since go f = ig and f og = ig, it follows that g is 
the inverse function of f. 


Solution to Exercise A39 


This is a translation of the plane that shifts each 
point to the left by 1 unit and up by 3 units, so we 
expect its inverse to shift the plane to the right by 
1 unit and down by 3 units. 
Let 
g : R? — R? 
(x,y) — («+ 1,y — 3). 
The domain of f is R?, and for each (x,y) € R? we 
have 
gf (z,y)) = g(x — 1,y +3) 
= («#—-1+1,y+3-3) 
= (x,y); 
that is, go f = ig2. 
The domain of g is also R?, and for each 
(x,y) € R? we have 


=(z+1-1,y-3+3) 
= (x,y); 
that is, f o g = ips. 


Since go f = ig2 and f og = ipz, it follows that g 
is the inverse function of f. 


Solution to Exercise A40 


The vector d is in the same direction as a, but 
none of the other vectors is; also, the magnitude of 
d is two-thirds that of a. Hence 


d= ża and a= 3d. 


Next, e is parallel to b but in the opposite 
direction; none of the others is parallel to these two 
vectors. Also, the magnitude of e is three times 
that of b. Hence 


e=-—3b and b= —fe. 


Finally, c and f are not multiples of any of the 
other vectors. 


Solution to Exercise A41 


The vector 3d is in the same direction as d, but its 
magnitude is three times that of d; the vector —2d 
is in the opposite direction to that of d, and its 
magnitude is twice that of d. 


y 


aY 


Solutions to exercises 
Solution to Exercise A42 
We use the rule for forming a scalar multiple of a 


vector, and the Triangle Law for the addition of 
vectors. 


YA p 
+ 
Pprq q 2p + lg 
3q 
P 2p 
T 


Solution to Exercise A43 


\ ii Ui JO ay 


x 


YA 


Solution to Exercise A44 
(a) Here p = (3, —1) and q = (—1, —2), so 
p +q = (3+ (-1),-1+ (—2)) = (2, —3), 


(ol). -1 — (-2)) = (4,1). 
(b) Here p = —i — 2j and q = 2i — j, so 
p+q=(-1+ 2)i+ (-2+ (-1))j =i- 3j, 
= Hie, 
p=q=(-t=2 2 Lie j. 
(c) Here p = —i + 2k and q = i — 2j — k, so 
p+q=(-14+1)i- 2j + (2 -— 1)k = -2j + k, 
=q = —i + 2j +k, 
p—~-q= (-1- 1)i-— (2j) + (2 - (—1))k, 
= —2i + 2j + 3k. 


95 


Unit A1 Sets, functions and vectors 


Solution to Exercise A45 
(a) Since p = (3,—1) and q = (—1, —2), 
2p = (6, —2), 
3q = (=3; =6); 
2p — 3q = (9,4). 
The magnitude of q is 
lal = /(-1)? + (2)? = v5. 
(b) Since p = —i+ 2k and q=i- 2j—k, 
2p = —2i + 4k, 
3q = 3i — 6j — 3k, 
2p — 3q = —5i + 6j + 7k. 
The magnitude of q is 


lal = VOF (-2? + CI? = vo 


Solution to Exercise A46 
(a) When v = (2, —3), the magnitude of v is 


lv = VZ + (-3)? = V449 = V13, 


so 
1 2 3 


VSS a9) Se), 
Ji. ) (- 13 v =) 
(b) When v = 5i + 12j, the magnitude of v is 


lv] = V52 + 12? = V25 + 144 = 13, 


so 
12 


v= —v= i+ >j. 


lv} 13 13 


Solution to Exercise A47 
(a) Since p = (5,3) and q = (1,4), 
p—q=(4,-1), 
p+q= (6,7), 
3P + 94 = (3,5) + (2,2) = (3,9). 


96 


Solution to Exercise A48 


(a) The vector form of the equation of | is 
r = (1 — A)(3, 1) + A(2, 3) 
= (3—A,14+2)). 
(b) Using the formula above with A = Z, 3 and —4 
in turn, we obtain the following position vectors: 
n=@-$14)=(9, 
rə = (3 — 3,1+3) = (8,4), 
r3= (3—(—3),1+(-))=G,9)- 
Thus the three points on the line are the points 


R,, Rə and R3, with coordinates ($, 4), (3,4) and 
(Z, 0), respectively. 


(c) yr 


Solution to Exercise A49 

(a) The vector form of the equation of | is 
P= 8-21): 

Hence at the point (4,—1) on l, we have 
(4,-1) = (8—A,1+2)). 


Equating the corresponding components gives 


=3-—X and —1=1-+2A. 


The first equation gives A = —1, and this value of 
A also satisfies the other equation. Hence the value 
of A corresponding to the point (4,—1) in the 
vector form of the equation of l is A = —1. 


(b) The point (5,4) lies on J if and only if there is 
some real number A for which 

(4,5) =(8-A, 1422). 
Equating corresponding components gives 


3-A=4 and 1+2=4 


The first of these equations has solution À = 3, and 
the second has solution À = —4. 


It follows that there is no real number A that 
satisfies the vector form of the equation of l, when 
r= (5, 5); so the point (5, 5) does not lie on l. 


Solution to Exercise A50 
(a) The vector form of the equation of the line / is 
r= (1 — A)(2,1,0) + A(1, 0, -1) 
= (2-24, 1-2, —-A). 
(b) Using the formula above with \ = 2 and —1, 
we obtain the following position vectors: 


n= (2-1-4 =) 
= ($4-4). 

r2 = (2 — (—1), 1- (~1), )) 
= (2.2.1) 


Thus the two points have coordinates (3, a —3) 
and (3, 2,1). 
Solution to Exercise A51 


We use the formula for the scalar product of 
vectors in component form. 


(a) (2,3)- (3,-4) =2x +3 x (-4) 
=5-12=-7 

(b) (1,4)- (2-4) =1x 244% (-4) 
=2-2=0 

(c) (—2i+ j) - (3i — 25) 
= (—2) x 3+ 1 x (—2) 


=-6-2=-8 
(d) (1, =d; —2) . (3, =2, —5) 
=1x3+(-1) x (—2) + (-2) x (—5) 
=34+2410=15 


Solutions to exercises 


Solution to Exercise A52 


In each case we let u denote the first vector of the 
pair, v the second vector, and 0 the angle between 
the two vectors. 


(a) Here 


Hence 
9 u.v 13 13 
cos 0 = = = ——, 
lullv|  v17v29 493 
so 
13 
0 =cos! — 
V493 
= 0.95 radians (to 2 d.p.). 
(b) Here 
u: v= (2,2). (1, =1) = =2 = 2 = —4, 
|u| = /(—2)? + 2? = V4 F 4 = V8 = 2V2, 
lv| = yaa =V14+1=Vv2. 
Hence 
9 u.v —4 
cos 6 = = =-l, 
jully]  2/2/2 
so 


8 = cos~!(—1) = 7 radians. 
You might have expected this result, because u and 
v point in opposite directions (in fact, u = —2v). 
(c) Here 
u.v = (9i — 2j) - (i+ 2j) 
=9x1+(2) x2 
=z9=ds=b, 


jul = V9 + ae = V81 +4 = V85, 
[v| = v 1? + 2? = 


97 


Unit A1 Sets, functions and vectors 


98 


Hence 
9 uev 5 _. 1 
een fully] V85V5 vI? 
so 
0 = cos (=) 
= 1.33 radians (to 2 d.p.). 


Solution to Exercise A53 


In each case we let u denote the first vector of the 
pair, v the second vector, and @ the angle between 
the two vectors. 


(a) Here 
u:v= (3, 4,5) + (1,0, —1) 
=3x1+4x04+5-x (-1) 
jul = V3? + 4? + 5? = v50 = 5 v2, 
v| = V12 + 0? + (-1)2 = V2. 


—2, 


Hence 
u.v 
cos = 
ul|v| 
O —2 O 1 
O V22 5 
so 
il 
0 = cos™! | —= 
cos ( 3 
= 1.77 radians (to 2 d.p.). 


(b) Here 
u-+v = (2j— 3k) -(-i—j—2k) 
=0x(@h 40% CD 4Cayx 2) 
—2 +6 = 4, 
|u| = y 02 + 22 + (—3)? 
= V4 +9 = V13 


v (-1)? + (—1)? + (-2)? 
VI+1+4= V6. 


v= 


Hence 
u.v 
cos 0 = 
|ul|v| 
B 4 _ 4 
~~ /1BV6 V78 
so 


e (A) 


= 1.10 radians (to 2 d.p.). 


Solution to Exercise A54 


Points (x,y,z) that lie in the (y, z)-plane all have 
x = 0; so x = 0 is the equation of this plane. 


Similarly, points (x,y,z) that lie in the (x, z)-plane 
all have y = 0; so y = 0 is the equation of this 
plane. 


Solution to Exercise A55 


(a) This plane is parallel to the (x, y)-plane and 
passes through the point (0,0, 2). 


plane z = 2 


ZA 


(b) This plane is parallel to the (x, z)-plane and 
passes through the point (0, —1, 0). 


plane y = —1 


X 


Solution to Exercise A56 


We use the formula 
X'n=p-'n 


for the equation of a plane, where x = (x,y,z), n is 
a normal to the plane and p is a point in the plane. 


(a) Here n = (2,3,1) and p = (1,0, 2), so the 
equation of the plane is 


(x,y,z) (2,3,1) = (1,0,2) = (2,3,1). 
This can be expressed in the form 

2x +3y+z=1x2+0x3+2x1, 
that is, 

20+ 3y +z = 4. 


(b) Here n = (4, —2,1) and p = (—1, 1,5), so the 
equation of the plane is 


(x,y,z) + (4,—2,1) = (—1,1,5) + (4, —2, 1). 
This can be expressed in the form 


4x — 2y + z = (—1) x 4+ 1 x (—2)+5 x 1, 


that is, 


4z — 2y + z = —1. 


Solutions to exercises 


99 


